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The  problem  of  finding  the  time-optimal  control  of  the  systems 
x = Alt(t)x  + A12(t)z  + B1(t)u 
pa  = A21(t)x  + A22(t)z  + B2(t)u 

and 

x * f<x,t)  + F(x,t)z  + B^(x,t)u 
M>z  = g(x,t)  + G(x,t)z  + B2(x,t)u 

is  treated  where  x £ Rn,  z € Rm,  p 0 is  a small  positive  parameter  and 
u 6 R is  constrained.  The  time-optimal  control  of  these  systems  is  shown 
to  possess  a two  time-scale  property  when  the  "fast"  state  z is  stable. 
This  property  is  that  the  optimal  control  is  composed  of  a control  in  a 
slow  time-scale  followed  by  a control  time  in  a fast  time-scale.  The 
"slow"  control  is  primarily  concerned  with  steering  the  "slow"  state  x. 
Based  on  the  two  time-scale  property  a near  optimal  control  is  presented 
which  can  be  calculated  on  reduced  order  time-optimal  control  problems  of 
order  n and  m.  Some  new  stability  bound  results  and  some  examples 
illustrating  the  near  optimal  control  are  presented. 
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1.  INTRODUCTION 


I. 1 Problem  Description 

Control  problems  for  singularly  perturbed  systems  with  unconstrained 
control  have  been  given  considerable  attention  and  major  results  have  been 
obtained  [l],  For  these  problems  the  concept  of  time  scale  decomposition 
has  been  developed  providing  a separation  of  slow  and  fast  dynamics  and 
a reduction  in  problem  order. 

More  recently,  results  [2-4]  have  been  obtained  for  the  time-optimal 
control  of  linear  time- invariant  singularly  perturbed  systems.  For  these 
problems,  time  scale  decomposition  again  implies  a separation  of  slow  and 
fast  dynamics  and  a reduction  of  problem  order.  Due  to  the  control 
constraint  invariably  present  in  time-optimal  problems,  the  control  is 
characterized  by  a slow  control,  primarily  dependent  on  slow  dynamics, 
followed  by  a fast  control  primarily  dependent  on  fast  dynamics.  This  is 
referred  to  as  the  two  time-scale  property.  The  implication  of  this 
property  is  that  the  time-optimal  control  of  systems  with  slow  and  fast 
states  should  first  concentrate  on  steering  the  slow  states  near  to  their 
desired  final  state.  Then  the  control  should  concentrate  on  rapidly 
steering  the  fast  states  to  their  desired  final  state  while  steering  the 
slow  states  the  remaining  small  distance  to  their  final  state. 

The  time-optimal  problem  treated  here  is  that  of  finding  the  control 
which  steers  the  slow  and  fast  states  of  a singularly  perturbed  system 
from  a fixed  initial  point  to  a fixed  final  point  in  minimum  time.  A 
major  concern  of  this  thesis  is  the  extension  of  the  classes  of  systems, 
for  which  the  two  time-scale  property  of  the  time-optimal  control  of 
singularly  perturbed  systems  has  been  revealed,  to  include  linear  time- 
varying  systems  and  a class  of  nonlinear  systems.  Thir  is  done  by  taking 


T 
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expansions  in  the  singular  perturbation  parameter  p,  of  the  necessary 
conditions  provided  by  the  minimum  principle.  Once  the  two  time-scale 
property  is  revealed,  two  systems  of  reduced  order  are  defined.  These 
systems  are  referred  to  as  the  reduced  order  system,  or  slow  subsystem, 
and  the  fast  subsystem.  It  is  computationally  simpler  to  solve  control 

problems  for  these  systems  due  to  the  order  reduction  and  the  removal  of 

j 

the  full  order  system's  stiffness  associated  with  the  singular  perturbation 
parameter  p.  The  purpose  in  revealing  the  two  time-scale  property  is  to 
make  it  possible  to  demonstrate  that  a near  time-optimal  control  can  be 
constructed,  for  singularly  perturbed  linear  time-varying  systems  and  a 
class  of  nonlinear  systems,  by  concatenating  a time-optimal  control  for 
the  reduced  order  system  and  a time-optimal  control  for  the  fast  sub- 
system. 

One  method  often  used  in  improving  the  near-optimal  controls  which 
, are  developed  for  singularly  perturbed  systems  is  asymptotic  expansions 

in  the  singular  perturbation  parameter  p.  In  this  thesis  another  method 
is  presented  for  time-optimal  problems.  This  method  is  called  the  inter- 
mediate point  algorithm  and  was  originally  developed  !_5,6j  for  the  time- 
optimal  control  of  linear  time- invariant  systems.  Originally  the 
algorithm  was  designed  for  singularly  perturbed  systems  which  are  block 
diagonalized.  Since  it  is  often  difficult  to  block  diagonalize  nonlinear 
systems,  the  intermediate  point  algorithm  is  altered  for  the  treatment  of 
some  nonlinear  examples. 

This  thesis  also  contains  some  new  stability  results  for  linear  time- 
varying  singularly  perturbed  systems.  Previously  [7,8],  it  was  shown  via 
Liapunov  functions  that  for  p small  enough,  the  stability  of  two  subsystems 
is  sufficient  for  the  stability  of  a full  order  system.  The  main  thrust 


of  the  results  presented  here  is  the  discovery  of  upper  bounds  for  the 
singular  perturbation  parameter  y,  such  that  for  p,  smaller  than  these 
bounds  the  systems  under  consideration  are  uniformly  asymptotically 
stable . 

1.2  Research  in  Time-Optimal  Control 

According  to  Fel'dbaum  one  of  the  earliest  theoretical  papers  on 
time-optimal  control  was  published  in  1949  [9],  In  [10]  Fel’dbaum 
presents  the  phase  plane  approach  to  time-optimal  control.  It  appears  to 
be  in  response  to  the  difficulties  encountered  in  finding  necessary 
conditions  for  the  optimality  of  constrained  control  problems,  such  as 
the  time-optimal  control  problem,  that  the  maximum  principle  was  developed 
by  Pontryagon,  et  al.  [ll].  In  [12]  Boltyanskii  has  produced  a highly 
readable  book  in  which  a proof  of  the  maximum  principle  is  given  in  terms 
of  the  time-optimal  problem.  From  this  proof  he  generalizes  to  treat 
other  problems.  Thus  one  might  say  that  time-optimal  control  can  be 
looked  at  as  being  at  the  base  of  the  maximum  principle. 

At  the  same  time  as  the  above  work  was  going  on,  Neustadt,  Lasalle 
and  Hermes  was  treating  time-optimal  problems  in  the  United  States.  In 
1952  Bushaw  [39]  for  some  simple  systems  demonstrated  that  a bang-bang 
control  was  time-optimal.  In  1959  Lasalle  [13]  presented  the  proof  of 
the  bang-bang  principle  which  in  essence  states  that  for  linear  time- 
varying  systems  if  there  exists  a time-optimal  control,  there  exists  a 
bang-bang  time-optimal  control.  A bang-bang  control  is  one  which  utilizes 
all  the  control  available  by  only  taking  values  on  the  edge  of  the 
control  constraint.  This  and  other  results  are  presented  in  [l4]. 
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Neustadt  and  Eaton  developed  an  iterative  method  for  determining  the 
initial  values  of  the  costates.  Neustadt' s work  treats  continuous  systems 
and  Eaton  considered  discretized  systems  [l2].  Plant  [ 15]  later  treated 
a number  of  iterative  methods  which  may  be  applied  to  time-optimal  control 
problems . 

A number  of  papers  have  been  published  which  consider  specific 
problems  or  systems  [16,17,18,19].  It  is  interesting  to  note  that  in 
C 18 ] the  time-optimal  control  for  a triple  integrator  is  calculated.  This 
control  law  was  implemented  in  the  lunar  module  of  an  Apollo  spacecraft 
and  performed  very  well.  In  [20]  the  sensitivities  in  cost  and  position 
to  small  changes  in  the  switching  times  of  bang-bang  controls  are  derived. 
Two  papers  which  consider  computational  techniques  for  finding  time- 
optimal  controls  are  of  interest  [21,22].  The  method  treated  in  [22]  is 
essentially  a switching  time  sensitivity  method.  Beginning  with  a guess 
for  the  time-optimal  control  a successive  approximation  for  changes  in 
switching  times  is  set  up  such  that  the  error  in  reaching  the  final  point 
is  decreased.  In  [2l],  a hill  climbing  method  is  applied  in  nested 
iterations  to  find  time-optimal  controls. 


1.3  Singular  Perturbation  Results 


The  singular  perturbation  approach  to  differential  equations  has  been 
used  for  some  time.  A long  list  of  such  references  is  contained  in  [23]. 
More  recently  singularly  perturbed  optimal  control  problems  have  been 
treated  and  a survey  of  major  results  is  contained  in  [l].  In  this  thesis 
two  major  areas  of  singular  perturbation  work  have  been  touched  on.  The 
first  area  is  covered  in  Chapter  2 and  deals  with  the  uniform  asymptotic 
stability  of  singularly  perturbed  linear  time- varying  systems  [7,8,24,25, 
26,27]. 
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The  rest  of  the  thesis  which  at  times  relies  on  the  results  con- 
tained in  Chapter  2 deals  with  the  development  of  decomposed  time-optimaJ 
control  for  singularly  perturbed  systems.  As  mentioned  earlier,  the 
initial  work  for  the  time-optimal  control  of  singularly  perturbed  system 
is  contained  in  [2,3,4].  In  [5,6]  an  iterative  method  which  is  based  on 
the  theory  in  [2,3,4]  is  developed.  This  method  computes  time-optimal 
controls  for  full  order  systems  from  the  solutions  to  reduced  order  time- 
optimal  control  problems.  It  is  called  the  intermediate  point  algorithm 
in  this  thesis  and  is  described  in  Chapter  4. 

A recent  paper  [28]  treats  the  problem  of  finding  the  time-optimal 
control  of  singularly  perturbed  time-varying  systems  which  are  nonlinear 
in  the  slow  state  and  linear  in  the  fast  state  and  control.  In  [29]  the 
feedback  time-optimal  control  of  the  reduced  order  system  of  a linear 
time- invariant  singularly  perturbed  system  is  treated.  There,  it  is 
shown  that  this  control  steers  the  slow  state  to  within  a compact  subset 
of  the  origin  (the  final  desired  point)  and  the  fast  state  to  some  point 
which  is  a finite  distance  from  the  origin.  Two  results  for  related 
problems  are  [30, 3l]  where  the  time-optimal  controls  of  systems  with 
regular  perturbations  are  treated. 

1.4  Chapter  Review 

Chapter  2 might  more  properly  be  called  Preliminaries.  It  contains 
the  derivation  of  bounds  for  singularly  perturbed  systems.  The  stability 
of  the  fast  subsystem  takes  on  added  importance  in  the  study  of  time- 
optimal  control.  It  is  shown  in  Chapter  3 that  the  stability  of  the  fast 
subsystem  is  a necessary  condition  for  the  two  time-scale  property  and 
therefore  these  bounds  guarantee  this  property.  In  the  next  section  of 


Chapter  2 an  example  of  the  application  of  these  bounds  is  presented 
followed  by  the  discussion  of  a stability  assumption  for  a class  of  non- 
linear singularly  perturbed  systems.  This  assumption  is  later  used  in 
the  development  of  the  two  time-scale  property  in  Chapter  5.  Then  the 
chapter  is  ended  with  a presentation  of  a block  diagonalization  trans- 
formation (used  in  Chapter  3)  and  some  lemmas  concerning  the  expansion 
of  two  common  integrals  via  integration  by  parts. 

Chapter  3 is  first  concerned  with  revealing  the  two  time-scale 
property  of  time-optimal  control  for  linear  time-varying  singularly 
perturbed  systems.  Then  a near  optimal  control  is  defined  based  on  this 
property. 

In  Chapter  4 the  intermediate  point  algorithm  is  defined  and  dis- 
cussed. Following  this  an  example  of  its  application  is  presented. 

Chapter  5 begins  by  showing  that  a class  of  nonlinear  systems 
possesses  the  two  time-scale  property  of  time-optimal  control.  Then  a 
near  optimal  control  for  these  systems  is  proposed  and  its  near  optimality 
proved. 

Nonlinear  examples  are  presented  in  Chapter  6.  One  of  these  examples 
is  not  within  the  classes  of  systems  treated  earlier  in  the  thesis.  How- 
ever, based  on  the  intuition  gained  during  the  earlier  work,  a near- 
optimal  control  it  found  for  these  systems  and  the  intermediate  point 
algorithm  is  applied  in  revised  form  to  improve  on  the  near  optimal 
control . 

Finally,  Chapter  7 contains  the  conclusions  and  possible  directions 


of  future  research. 
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1.5  Notation 

Vectors  and  scalars  are  generally  represented  by  lower  case  letters 

and  matrices  by  upper  case  letters.  The  derivative  with  respect  to  time 

• • 

of  a vector  x or  a matrix  A is  denoted  by  x or  A.  The  transpose  of  a 
matrix  A is  denoted  by  A'.  The  norm  of  a matrix  A is  written  |a|.  The 
norm  used  here  is 

I A | = (2  a..2)172 

where  the  a.  . are  the  elements  of  the  matrix  A.  For  the  norm  of  a scalar 
this  reduces  to  the  absolute  value.  A set  U composed  of  objects  u 
characterized  by  some  property  P is  defined  by 

U = {u:u  has  the  property  P]  . 

Throughout  this  thesis  a vector  f(x,t)  is  written  as  f in  cases  where  no 
confusion  will  result  from  dropping  the  dependence  on  x and  t. 
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2.  STABILITY  PROPERTIES 


2.1  Introduction 

In  this  chapter  some  bounds  on  p,  are  found  which  guarantee  the 
uniform  asymptotic  stability  of 

* 

x = A1]L(t)x  + A12(t)z  + B^  (t)u  (2.1a) 

V*  - A21(t)x  + A22(t)z  + B2(t)u  (2.1b) 

where  x £ Rn,  z £ Rm,  u £ Rr  and  p,  is  greater  than  zero.  The  following 
assumptions  are  made: 

(i) 

(ii) 

(iii) 


In  Lemmas  2.2  and  2.3  under  assumptions  (i)  and  (ii),  a bound  p.^  is 
found  such  that  for  p>  £ (0,p,q)  the  equilibrium  y = 0 of  the  fast  subsystem 


The  matrices  A„(t),  i,j  = 1,2,  are  bounded  and  have 
bounded  first  derivatives  for  all  t. 

The  eigenvalues  X^(t)  of  A22(t)  satisfy 


Re(X.(t))  < - v < 0 

for  all  t where  Y is  a constant. 
The  reduced  system 


(2.2) 


w * (Au(t)  - A12(t)A22_1(t)A21(t))W 
= AQ(t)w 

has  w * 0 as  the  uniformly  asymptotically  stable 
equilibrium. 


(2.3) 


p.y  - A22(t)y 


(2.4) 
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is  uniformly  asymptotically  stable.  Then  in  Lemma  2.4,  under  assumptions 
(i) , (ii)  and  (iii),  a bound  p^  is  found  such  that  for  p,  € (0,p^)  the 
equilibrium,  u = 0,  x = 0 and  z =>  0 of  the  full  order  system  (2.1)  is 
uniformly  asymptotically  stable.  Under  (i),  (ii)  and  (iii)  it  has  been 
known  for  some  time  [7,8]  that  (2.1)  is  uniformly  asymptotically  stable 
for  p.  small  enough.  In  [24]  it  is  shown  that  under  (i)  and  (ii),  (2.4) 
is  uniformly  asymptotically  stable.  The  new  results  here  are  the  bounds 
P’0  and  p^. 

After  these  stability  lemmas  have  been  stated  and  proved,  a 
stability  assumption  concerning  the  nonlinear  system 

x = f(x,t)  + F(x,t)z  + B^(x,t)u  (2.5a) 

Pz  = g(x,t)  + G(x,t)z  + B2(x,t)  u (2.5b) 

is  presented  and  discussed  in  preparation  for  Chapter  5 where  the  time- 
optimal  control  of  (2.5)  is  treated. 

Finally  a diagonalization  transformation  and  two  lemmas  are  pre- 
sented. In  these  lemmas  expansions  in  p.  are  presented  for  two  integrals 
which  appear  often  in  the  development  in  this  thesis. 

The  stability  lemmas  for  systems  (2.1)  and  (2.4)  are  presented  in 
Section  2.2  and  an  example  illustrating  the  use  of  these  lemmas  is  pre- 
sented in  Section  2.3.  Section  2.4  contains  the  discussion  of  a stability 
assumption  for  system  (2.5).  Section  2.5  contains  the  diagonalization 
transformation  and  in  Section  2.6  expansions  in  p.  are  derived  for  two 
integrals . 
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2.2  The  Stability  of  Linear  Time-Varying  Systems 

In  the  presentation  of  this  section  we  will  need  the  well-known 
lemma  of  Gronwall. 

Lemma  2.1  (Gronwall' s Lemma  [32]).  Let  X(t)  be  a real  continuous 
function  and  Y(t)  a non-negative  continuous  function  on  the  interval 
[t0,ti3.  If  a continuous  function  y(t)  has  the  property  that 

t 

y(t)  < Mt)  + J Y(s)y (s)ds 


for  t^  < t < t^,  then  on  the  same  interval 

t t 

y(t)  < ^(t)  + J X(s)Y(s)exp(J  Y(t)dt)ds  . 

*0 

The  proof  of  this  lemma  is  contained  in  [32]. 

In  the  next  lemma  a bound  p,  on  p.  is  found  to  guarantee  that  the 
equilibrium  y = 0 of  system  (2.4)  is  uniformly  asymptotically  stable  under 
assumptions  (i)  and  (ii).  Assumption  (i)  implies 


1^22  (t)  ” ^22  * — 3(t-tg) 


(2.6) 


where  0 is  a constant  equal  to  the  maximum  of  | for  all  t by  the 

mean  value  theorem.  Also  for  t > tQ  there  exists  a K such  that 


(tQ) 


(2.7) 


when  (ii)  is  satisfied  _ 3 8 i . 


Let  $22*'t»t:o^  be  tbe  state  transition  matrix  of  (2.4)  and  define 


9(t,t0) 


<P(t,t0)  t *22(t,t0)  " 6 


A22(t0)  I g 


(2.8) 


Lemma  2.2.  Let  (i)  and  (ii)  be  satisfied  and  for  any  a € (0,Y), 
let  g = a2/8K.  Then  for  g € (0,g),  cp(t,tn)  satisfies 


'(’(W  = 0 


(2.9) 


l«P(t,t0)  1 < 2 e~g 

e (a  -gK9) 


(2.10) 


where  a = Y - a > 0. 


Proof  of  Lemma  2.2.  The  definition  of  9(t,tQ)  implies  (2.9)  and 


(2.11), 


• A22^^  A22^b^  ” A22^b0^  A22^t0^  u.  I 

9(t,t0)  *-21_9(t,t0)  +-2S --2-2-a-  e 22  0 /.  (2.11) 


Applying  the  variation  of  constants  formula  to  (2.11)  we  obtain 


i » A22(to)  nr 

«*.*«>>  "u-f  e22  /(A22(t)  - A22(t0))e 

C0 


A22(V  g 


i t A22(to>hr 

+ Z J 'PU,  t)  (A22(t)  * A22^C0^e 


(2.12) 


a((t-tQ)/p) 

Let  Y = a + C,  multiply  (2.12)  through  by  e and  let 


* 

I 


1Kt,t0)  - e 


cr((t-t0)/y.) 


to  yield 


net 


x*.  °^r)  A22(t0)i'vl 

t ) = — e 1 1 e 

»c0'  vi  J 


X (A2£(t)  “ A22^0^e 


A22(t 


,p 

0;\  p 


dT 


T-t, 


+ 1 J TKt,T)e  ' **  /(A22(t)  - A22(tQ))e 


A22^C0^ 


We 


next  construct  the  successive  approximation 


n(k+1) (t 


t).ij  .°P)  >(to,M 

»t0;  p J 


. A22(t0) I H , dT 

X (A22(t)  - A22(tQ))e  ' dT 


T-t, 


+ -J  Tl(k)(t,T)e 
“ t 

0 


T-t, 


X (A22(T)  - A22(tQ))e 


A22(t0M  p 


dT 


with  initial  guess  T\<°>  Ct,tQ)  = 0.  Substituting  (2.6)  and  (2. 
(2.13)  and  integrating  for  T](1)(t,t0)  we  obtain 

|T](1)(t,t0)!  < ^ e 1 **  ^ \-r)  -2J~2 


(2.1 


7)  into 


13 


we  have  by  induction 


lH(k+1)(t,t  ) - >|  <2  l^f 

u l a a e / 


Define  p = Since 

a 


|H<k)(t.t0)  - T1(0)(t,t  )|  < E |Tl(j)(t,t0)  - Tl(j_1)(t,t0)| 


J-l 


< 2(pk_1  + ...  + p + 1) 


pk29 


2 2 
a e 


= 2 


/ 1 - ok  \ uK28 


1-p  2 2 

F 1 a e 


then  for  p<loru<ct/BK 


lim  |Tl (t , t_)  | = lim  |T)  (tjtg)  “ (c>tg)  I 
k — ® k “*  ® 


dK29 


1 2 UK  3 = 2 

1-p)  a2e2  e2(a2-uSK) 


Thus  for  u < a2/9K  the  successive  approximation  (2.13)  converges  to  a 
solution  which  satisfies 


ln<c,t)l<2  2*£— 

0 e2(aZ-uBK) 


cp(t,tQ)  = Ti(t,t0) 


and  therefore 


|«(t,t0) 1 


< -2K23 

“ e2(a2-u8K) 


(2. 


15 


This  completes  the  proof  of  Lemma  2.2. 

This  lemma  implies  that  for  p.  € (0,p.)  the  fast  subsystem  (2.4)  is 
uniformly  asymptotically  stable  for  any  a € (0, Y)  where  -Y  is  the  constant 
upper  bound  on  the  real  parts  of  the  eigenvalues  of  A22(t)*  A consequence 
of  the  proof  of  this  lemma  is  equation  (2.10)  which  provides  a bound  on 
the  error  incurred  in  approximating  the  state  transition  matrix  ^22^t,t:0^ 
of  (2.4)  by  the  transition  matrix 

A22(t0)  IT- 

e 

of  the  time- invariant  system 

wy  = A22(t0)y  . (2.16) 

As  can  be  seen  this  error  is  0(u)  and  exponentially  decaying  with  an 
0(p.)  time  constant. 

Lemma  2.3.  Let  assumptions  (i)  and  (ii)  be  satisfied  and  set 
2 

Uq  * Y /£K.  Then  for  p.  € (0,Uq)  system  (2.4)  is  uniformly  asymptotically 
stable. 

Proof  of  Lemma  2.3.  Define  Pq  = Y /SK  and  a = / pSK  + Y/2  where 

p,  € (0,p,p).  Then  a = Y - Cl  > 0 and  for  p.  € (0,p.g)  equation  (2.15)  implies 

that  (2.4)  is  uniformly  asymptotically  stable  since  the  definition  of  c. 

2 

implies  that  a - pKS  is  never  equal  to  zero.  The  proof  is  finished. 

] 

In  Lemma  2.4  it  is  assumed  that  (iii)  is  satisfied  and  the  fast  sub- 
system  (2.4)  is  uniformly  asymptotically  stable.  Under  these  assumptions 


the  bound  p^  is  found  such  that  for  p.  5 (0,p^)  the  equilibrium  u = 0, 
x * 0 and  z = 0 of  (2.1)  is  uniformly  asymptotically  stable.  Then  in 
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Lemma  2.5  the  bounds  of  Lemmas  2.3  and  2.4  are  combined  to  find  the 

* * 
bound  |i  such  that  under  (i),  (ii)  and  (iii)  and  for  p € (0,  p,  ) the 

above  equilibrium  is  uniformly  asymptotically  stable. 

In  order  to  simplify  the  statement  and  proof  of  the  lemma,  the 

transformed  system 


x = AQ(t)x  + A12  (t)T) 


MJ1  = H(L(t)  + L(t)AQ(t))x  + A22  (t)T]  + ML(t)A12(t)71 


is  treated  which  is  the  result  of  applying  the  transformation 


(2.17) 


T]  = z + A22(t)  A21(t)x  t z + L(t)x 


(2.18) 


to  (2.1)  with  u set  equal  to  zero.  Here  Aq  is  as  defined  in  (2.3). 
Clearly  if  (2.17)  is  uniformly  asymptotically  then  the  equilibrium  u = 0, 
x = 0 and  z = 0 of  (2.1)  is  uniformly  asymptotically  stable. 

Lemma  2.4.  Let  (2.3)  and  (2.4)  be  uniformly  asymptotically  stable 
systems  so  that  their  state  transition  matrices  satisfy  (2.19)  and  (2.20) 
respectively 


Ity^Vl  -Ki 


-al(t-t0) 


l^22^'t,t0^  ' - K2e 


V c ^ co 


(2.19) 


(2.20) 


If  constants  M2  and  exist  such  that  for  all  t 
|A12Ct)  l < Mi,  |L(t)A12(t)  | <M2, 


|L(t)  L(t)AQ(t)  | <M3  , 


(2.21) 
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then  for  all  y.  € (0,y^),  where 


^1  = a1K2M2  + k1H1k2m3 


the  equilibrium  u = 0,  x = 0 and  z = 0 of  (2.1)  is  uniformly  asymptotically 


stable. 


Proof  of  Lemma  2.4.  Applying  the  variation-of-constants  formula  to 


(2.17)  yields 


x(t)  = §0(t,t0)xQ  + J §0(t,t)A12(T)71(T)dT 


TKt)  = $22(t,tQ)ll0  + J «22(t,T)L(T)A12(T)71(T)dT 


+ J $22(t,T)(L(T)  + L(T)A0(T))x(T)dT 

co 


where 


= 20  + A22  ^0^21 


The  bounds  of  equations  (2.19),  (2.20)  and  (2.21)  imply 


-o  (t-t  ) c -a  (t-T) 

|x(t)  1 < Kxe  1 )x0|+J  Kxe  M1|'^(T)|dT  (2.22) 


j 
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f V 


|TKt)  | < K2e 


-cr„ 


t-t. 


2 n 


-o„ 


1^0  1 + J K2e  \ ^ /M2|Tl(T)|dT 


t-T 

Vi 


(2.23) 


r ”°2'  a 

*1  V * 

t 

0 


M_  Jx(t)  [dT  . 


In  this  proof  we  apply  Gronwall's  Lemma  to  (2.23)  and  then  to  (2.22)  to 
derive  an  upper  bound  such  that  for  y,  6 (0,y^),  the  absolute  values 
|x(t)  | and  |T) (t ) | are  bounded  by  a decreasing  exponential.  Letting 


<?2t/vi 


w(t)  = e |H(t)  | in  equation  (2.23)  yields 


°2V>\_  , . r 


W(t)  <K2e  iTl0l  + J*  K2e  * Mj  |x(t)  |d T 


+ j K2M2w(T)dT 


Applying  Gronwall's  Lemma  and  integrating,  we  obtain 


a,tn/|j.  K,M.(t-t.) 
20  |T|0|.  22  0 


t K M?(t-T)  a T/n 
+ J e X2e  ^ M3|x(T)ldT 

fc0 


which  yields 


-a  (t-t  ) *■  -<Vt-r) 

8 0 l I -3  I, 


lT)(t)  ] < K2e  ° !^10!+J  K2M3e  |x(T)  |dT 


where  a3  = ~2/u  - K2M2-  In  the  following  we  will  need  a3  > 0. 


(2.24) 


j 


Substituting  (2.24)  into  (2.22)  yields 


|x(t)  1 < Kxe 


-a  (t-t  ) t -a  (t-T) 

1*0 1 + J v 

c0 


MlK2 l”o 


-o3<T-t0) 


dT 


c -a  (t-T)  t 
+ J Kie  M1(J 

C0  tn 


-a-(t-s) 

K„M„e  |x(s)  |ds)dT 


which  implies 


:(t)|  < 


h IxqI  ' 


K. 


1M1K2  lT'0 


a3-°l 


-ai(t-t0) 


a3-Gl 


K1M1K2M3  r -G3(t_T) 


I 


t. 


x(t)  |dr  . 


V 


Let  y(t)  = e |x(t)  |,  apply  Gronwall's  Lemma  and  integrate  to 
obtain 


|x(t)  1 < 


W2  1*0 


a3'ai 


1 ~ 


K1M1K2M3 


-V£-V 


K.  x 


W2 1% 


1 1 0 1 a3"ai 


i 

(1  + i)e 


'K1M1K2M2 

— ’ °r 


aiKI  1 L , w^Vil 


M3*W  a 


K. 


1 I 


a3“°l 


where 


K.  M.K«M_ 


Thus  for  (2.17)  to  be  uniformly  asymptotically  stable  we  need  in- 
equalities (2.25)  and  (2.26)  to  be  satisfied, 


a 


3 


(2.25) 


KM  KM 

a3  - -4^  > o . 


(2.26) 


Let 


* ii 

T.  a,  K„M„  + K.M,K„M. 


1 '2  2 


ri2u3 


If  |i  € (0,p^),  inequalities  (2.25)  and  (2.26)  are  satisfied  and  therefore 
(2.17)  is  uniformly  asymptotically  stable  which  implies  that  the 
equilibrium  u = 0,  x = 0 and  z = 0 as  the  full  order  system  (2.1)  is 
uniformly  asymptotically  stable.  Lemma  2.4  is  proved. 

The  corollary  follows  directly  from  Lemmas  2.3  and  2.4. 

Corollary  2.5.  Let  p,  = min(u.g,  p^) . Assumptions  (i),  (ii)  and 
(iii)  guarantee  that  for  p.  € (0,p,  ),  the  equilibrium  u = 0,  x = 0 and 
z = 0 of  (2.1)  is  uniformly  asymptotically  stable. 


2.3  Example 

The  example  illustrates  the  calculation  of  the  stability  bounds  for 
p.  derived  in  the  last  section  for  the  system 


-4  + cos  t 


-1  + 1.1  cos  t 


1-1.1  sin  t cos  t 


-1  = 1.1  sin  t cos  t -1  +-  1.1  sin  t 


The  reduced  system  is 


(2.27) 


— 2 — 
x = (-3.474  + cos  t - 1.222  sin  t)x 


(2.28) 


and  the  fast  subsystem  is 


“1  + 1.1  cos  t 1 - 1.1  sin  t cos  t 

M'T]  = 2 • 

-1  - 1.1  sin  t cos  t -1  + 1.1  sin  t 


(2.29) 


When  (i  = 1 an  unstable  fundamental  solution  of  (2.29)  is 


$2(t,0)  = 


•It  -t  . 

e cos  t e sm  t 


.It  . „ -t 

■e  sm  t e cos  t 


even  though  the  eigenvalues  of  A22(0  have  real  parts  = -0.45  for  all 
t [33,  p.  147].  Since  system  (2.27)  satisfies  (i)  and  (ii)  and  system 
(2.28)  is  uniformly  asymptotically  stable,  we  know  that  for  p.  sufficiently 
small,  both  systems  (2.29)  and  (2.27)  are  uniformly  asymptotically  stable. 

Fixing  the  coefficients  of  the  fast  subsystem  at  any  t = t^,  we 
obtain  the  linear  time-invariant  system 


-1  + 1.1  cos  t. 


1 + 1.1  sin  tg  cos  tgl 


-1  - 1.1  sin  tg  cos  Cq  -1  + 1.1  sin  t^ 


1 . (2.30) 


r 


22 


The  state  transition  matrix  for  (2.30)  is 


A22(t0)T 


-.45t 


aH(t0)cos (•835T-6n(t0) ) OL2(t0)sin(.835T) 
'a21(t0)sin(-835T)  a22(t0)cos(.835r-622(t0))J 


where  T 


aLl(t0)  = <1-377  - L-617  sin2t0  + 1.734  sin4tQ)1/2  , 
al2^t0^  = (1#^8  " 1*317  sin  tQ  cos  tQ)  , 


a21(t0^  = (-1*198  - 1.317  sin  tQ  cos  tQ)  , 

°22(t0)  = (1-377  - 1-617  cos2t0  + 1.734  cos4t0)1/2  , 

6n(t0)  = tan  ^■(.eu  - 1.317  sin2tQ)  , 

^22^C0^  = tan  ^(*8^  " 1.317  cos2t0)  . 

2 1/2 

Using  as  a norm  (La_  (£q))  we  find  that  K = 7.358  and  V = 0.45. 
Correspondingly  we  find  the  max  |A22(t)  | * 1.555  = 8.  The  values  of  8,  K 
and  Y and  Lemma  2.3  imply  that  ^ = .0177  and  that  for  0 < p.  < u^,  system 
(2.29)  is  uniformly  asymptotically  stable. 

Since  is  found  by  taking  various  matrix  norms  it  is  conservative. 
In  a computer  simulation  the  fast  subsystem  (2.29)  was  numerically 
integrated  for  the  initial  condition  z^(0)  » z2(0)  = 1 for  various  values 
of  u.  It  was  found  that  for  p.  < .65,  the  solution  was  exponentially 
decaying.  In  Figures  2.1,  2.2  and  2.3  the  solution  for  is  shown  for  p, 
equal  0.1,  0.4  and  0.7  respectively.  From  Figures  2.1  and  2.2  it  can  be 
seen  as  expected,  that  for  smaller  p,  the  exponential  decay  is  more  rapid. 
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We  next  find  a bound  for  the  stability  of  the  full  order  system 
(2.27).  From  Lemma  2.3  we  obtain 


^22^t,C(P  1 - K 


1 + 2 


PKB 


e2(a2-pK0) 


-Ct/g. 


(2.31) 


If  we  let  a = a = V/2  we  obtain  a value  for  p,  of  .00442.  For  p.  € (0,p.) 
we  may  use  the  bounds  of  (2.31)  for  ^22^t,t:o)*  Thus 


K 


2 


K 


2 pice 

e2(a2-pxe)_ 


and  a 2 s G ~ *225.  From  equation  (2.28) 

-1.89(t-tn) 

1*^)1  < |*0  |e 


which  yields  = 1 and  = 1.89. 

Values  for  M^,  and  are  1,  1.956  and  7.09  respectively.  Sub- 
stituting these  values  into 


< CT1G2 
* ~ + K.  M.  K_M. 


12  2 


112  3 


yields  p^  = .00317.  Since  p,  < p.  we  know  from  Corollary  2.5  that  for 
p,  € (0,p^)  system  (2.27)  is  uniformly  asymptotically  stable. 

This  example  illustrates  the  use  of  Lemmas  2.2,  2.3,  2.4  and 
Corollary  2.5  in  obtaining  stability  bounds  of  p,  in  system  (2.27).  The 
bounds  Kj  and  O are  direct  results  of  Lemma  2.2,  thus  making  it  unnecessary 
to  determine  the  state  transition  matrix  for  the  fast  subsystem  directly. 
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2.4  A Stability  Assumption 


In  Chapter  5 the  time-optimal  control  of  nonlinear  system  (2.5)  is 
treated  where  x € Rn,  z € Rm,  u € Rr  and  p,  is  greater  than  zero.  The 
purpose  of  this  section  is  to  discuss  the  acceptability  of  stability 
assumption  which  is  made  in  Chapter  5 concerning  system  (2.5). 

We  assume  that  the  Jacobians  g (x,t)  and  (B-(x,t)u)  and  the  partial 

a SB2  X X 
derivatives  and  exist  and  are  continuous  with  respect  to  x and  t. 

The  controls  under  consideration  satisfy  the  constraint 


u € U = {u:  |ui | < 1,  i = 1, . . . ,r} 


and  their  components  u^  are  piecewise  constant  with  a finite  number  p of 

discontinuities  at  the  instants  t . where 

J 


C0  <C1  <C2  46  — <Cp  * 


That  is,  at  the  instants  t^  at  least  one  of  the  components  of  u is 
switching  from  one  constant  value  to  another  constant  value.  Suppose  we 
choose  an  arbitrary  control  u(t)  satisfying  the  above,  and  apply  it  to 
(2.5)  to  create  the  trajectory  x(t),z(t).  It  is  assumed  that  for  x(t) 
the  inverse  of  G(x,t)  ^ exists  for  t € 


The  stability  assumption  to  be  discussed  in  this  section  may  be 


stated: 


(i)  The  equilibrium  w = 0 of  the  system 


pw  = G(x(t),t)w 


(2.32) 


is  uniformly  asymptotically  stable,  for  p,  small 


enough. 


The  approach  used  in  the  application  of  singular  perturbation 
methods  to  approximate  the  solution  to  (2.5)  with  control  input  u(t)  is 
to  set  n = 0 in  (2.5b)  to  yield 

z * - G(x,t)  Lg(x,t)  - G(x,t)  1B2(x,t)u  (2.33) 

where  the  bar  (t)  denotes  p = 0.  Substituting  (2.35)  in  (2.5a)  yields 
the  reduced  order  system 

x = f0(x,t)  + BQ(x,t)u 

£ f(x,t)  - F (x,t)G(x, t)  Lg(x,t)  (2.34) 

+ (B1(x,t)  - F(x,t)G(x,t)  1B2(x,t))u  . 

One  assumption  [26]  which  will  guarantee  that  the  solution  x(t)  of 
(2.34)  with  input  u(t)  provides  an  approximation  for  x(t)  is  that  the 
equilibrium  z of  system  (2.5b)  is  uniformly  asymptotically  stable.  We 
will  show  that  the  assumption  (i)  above  is  equivalent. 

Consider  x(t)  and  u(t)  as  inputs  to  (2.5b).  Then  (2.5b)  may  be 
treated  as  the  linear  time-varying  system 

pz  = G(t)z  + g(t)  + B2(t)u(t)  (2.35) 

which  has  the  solution 

t 

Z(t)  = §u(t,t0)20  j ^(t,T)g(T)dT 

tQ  (2.36) 

t 

+ 7 J §^(t,T)B2(T)G(T)dT  . 

" *0 


t •• 


29 


Along  any  control  interval  (t^t^)  during  which  u(t)  = K where  K 
is  a vector  of  constants,  we  may  integrate  (2.36)  by  parts  to  yield 


- G(t)_1i(t)  + $p(t,ti)G(t.)‘1g(t.) 
t 

+ J*  ^ (t,T)[G(T)'1g(T)  + G(T)_1g(T)]dT  (2.37) 

t. 

1 

+ G(t)_1B2(t)K  + §ti(t,ti)G(t.)‘1B2(t.)K 

t A i« 

+ J $^(t,T)[G(T)'iB2(T)K  + G(T)“1B2(r)K]dr 
ti 

where  z±  is  the  value  of  z at  t±.  Since  g(t)  = g(x(t),t),  Bz(t)  = B2(x(t),t) 
and  G(t)  = G(x(t),t),  (2.37)  may  be  written 

z(t)  = $ti(t,t.)[z.  + G(^(ti),t.)"1(g(5(ti),ti)  + B2(x(t.),t.)K)] 

- G(x(t),t)  1 g (x ( t ) , t ) - G(x(t),t)“1B2(x(t),t)K 

t ^ (2.38) 

+ J * (t,T)[G(T)“l(g(r)  + B2(t)K) 

t . 

X 

+ G(T)_1(g(T)  + B2(T)K)]dT  . 

Suppose  that  $ (t,t.)  satisfies 
U»  2. 

-a|— ) 

l^(t,ti)|fKe  ^ 


(2.39) 
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then  the  integral  in  equation  (2.38)  is  0(p.)  and  the  first  term  will 
exponentially  decay  with  0(|i)  time  constant.  This  exponential  decay  is 
referred  to  as  boundary  layer  phenomena  and  occurs  at  each  switching  of 
the  control  u(t).  The  remaining  (middle)  term  in  equation  (2.38)  is 
simply  the  equilibrium  we  wish  to  be  uniformly  asymptotically  stable. 

Assuming  that  (2.39)  is  satisfied  is  the  same  as  the  above  assumption. 
It  has  been  shown  that  under  this  assumption 

z(t)  = z(t)  + 0(|i)  + boundary  layer  terms  . 

Therefore  substituting  z(t)  into  (5.1a)  to  yield  the  reduced  order  system 
will  result  in  x(t)  being  an  approximation  of  x(t). 

2.5  A Diagonalization  Transformation 


In  this  section  system  (2.1)  and  its  adjoint  system 
p = - Au(t)p  - £ A21(t)q 
4 = - A12(t)p  - i A22(t)q 


(2.40) 


will  be  lower  block  diagonalized  and  upper  block  diagonalized  respectively 
using  the  transformation  defined  by 


X " 

' In 

X 

_C  _ 

0 I 

- m J 

z 

L J 

’ I 0 * 

n 

w 

p 

T1 

W J 

uF(t)'  I 

u m. 

q 

— m 

(2.41a) 


(2.41b) 


This  diagonalization  will  simplify  the  process  of  revealing  the  main 
results  of  the  next  chapter.  When  (2.41a)  is  applied  to  (2.1)  and  (2.41b) 
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to  (2.5)  the  result  is 

X = (Au  - FA21)X  + GC  + (Bl  - FB2)u 
- a21x  + (a22  + MA21F)  c + b2u 

and 

% - - (Au  - FA21)  ? - “ A21T1 
7, . . o'i  - -1  <A22  + ^21F)'n 

where 


(2.42) 


(2.43) 


G(t)/  = A{2  - A^F ' - pF'  + 


^An 


pF'A^F' 


(2.44) 


and  (?  ',T\')  are  the  adjoint  variables  corresponding  to  (X'jGO  • In  the 
work  presented  in  this  thesis  the  problem  under  consideration  is  that  of 
time-optimal  control.  In  order  for  this  problem  to  have  meaning  the  time 
required  for  the  control  must  be  finite.  Therefore  we  only  consider 
transforming  systems  (2.1)  and  (2.40)  on  the  finite  interval  [tg,t^]. 

Chang  [34]  has  shown  that  under  the  assumption  that  the  eigenvalues 
of  A22(t)  have  real  parts  strictly  less  than  zero,  there  exists  a bounded 
solution  F(t,p.)/  to  G(t)y  = 0.  He  further  shows  that  one  solution 
G (t)  * = 0 is 

F(t,p,)'  = A'2_1A'2  + i»(t,n)  (2.45) 

where  W(t,u.)  can  be  shown  to  be  bounded  on  Ctg,t^].  Substituting  (2.45) 
into  (2.42)  and  (2.43)  yields 


x = AL(t)X  + (B0(t)  - pM(t,ii)B2(t))a 


HC  - A21(t)X  + A2(t)C  + B2(t)u 
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(2.46) 


and 


where 


§ - - Ax  (t)§  - ~ A21  (t)T| 
T1  - - ^ A2  (t)Tl 


A1  = A0  ^A21  * 

A2  = A22  + UA21A12A22  + y,  A2]_W  , 


(2.47) 


(2.48) 


W £ W(t,y)  , 


-1. 


and 


B0  ~ B1  " A12A22  B2 


A0  = A11  " A12A22  A21  * 


Thus  (2.46)  is  lower  block  diagonal  and  (2.47)  is  upper  block  diagonal. 


2.6  Expansions  of  Two  Integrals 


In  this  section  the  two  integrals 


ll  * J*  522(t,T)N(T)dT  t € CtQ, tx] 


(2.49) 


and 


I2  * J N(s)§22(s,t0)ds  t € [t(J,t13 

t„ 


(2.50) 


( 
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3 

are  expanded  as  series  to  0(p.  ) where  $22(t’t(p  is  the  state  transition 
matrix  of  the  fast  subsystem  (2.4).  The  following  assumptions  are  made: 
(i)  The  matrices  A 22 (c)  anc*  N(t)  are  continuous  and 
have  continuous  derivatives  on  [t^.t^]  to  any 
desired  order. 

(ii)  The  eigenvalues  ^(t)  of  A22(t)  satisfy 

Re(Xi(t))  < -V  < 0 t € [tQ.tj] 

where  V is  constant. 

Lemma  2.6.  Under  (i)  and  (ii)  the  integral  I may  be 
expanded  as 

h = -^22"1(t)N(t)  + U$22(t,t0)A22"1(t0)N(t0) 

" ^2A22"1(t)A22~1(t)N(,:)  + ^2522(t,t0)A22"1(t0) 

(2.51) 

x A22_1(t0)N(t0)  - p2A22'2(t)N(t) 

+ H2§22(t,t0)A22‘2(t0)N(t0)  + 0(H3) 

for  ii  sufficiently  small. 

Proof  of  Lemma  2.6.  The  integral  1^  may  be  expressed 
t 

h ■ J 122<t-T>';A22<T>‘’A22'1<T>,,<T>dT  • 

co 

Letting 

dw  = ’ u 522^C’ T)A22^T^dT 

and 

v =>  - ^20  i(T)N(T) 


f] 


j 


1 

<1 
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implies 

v * §22(t,T) 

and 

dv  = - pA22  (T)N(T)dT  - \iA22~  (T)N(T)dT 

which  yields 

t t . 

X1  = ’ ^22(t,T^A22  1<T)N(T)  i + ^ J ^22(t,T^A22  <T)N(T)d'r 

T=t0  C0 

t 

+ uj  §22(t,T)A22'1(T)N(T)dT  . 

C0 

Noting  that  for  p.  sufficiently  small  system  (2.4)  is  uniformly 
asymptotically  stable  by  Lemma  2.3  and  integrating  these  two  integrals 
by  parts  yields 

, t 

I.  - - V^9,(t,T)A  „ (T)N(T)  | 

1 22  i-t0 

t 

- g.2§-.(t,T)A„'l(T)A’1(T)N(T)  j 

T=tQ 

t . 

- U2§22(t,T)A22  (T)N(T)|  + OU  ) 

T=t0 

which  yields  (2.51)  and  Lemma  2.6  is  proved. 

Lemma  2.7.  Under  (i)  and  (ii)  the  integral  I2  may  be 
expanded  as 


r 


1 
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I2  = MN(t)A22"1(t)$22(t,t0)  - MN(t0)A22'1(tQ) 

- y.2N(t)A22'2(t)§22(t,t0)  + p.2N(t0)A22"1(t0) 

(2.52) 

- ti2N(t)A22  1(t)A22"1(t)§22(t,t0) 

+ p,2N(tQ)A22  1(t0)A22"1(to)  + °(^3) 
for  p.  sufficiently  small. 

The  proof  of  Lemma  2.7  follows  analogously  to  that  of  Lemma  2.6. 


i 


mum 
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rr 


3.  TIME-OPTIMAL  CONTROL  OF  TIME-VARYING  SYSTEMS 

3.1  Introduction  and  Problem  Statement 

In  this  chapter  we  treat  the  time-optimal  control  of  system  (2.1) 
which  for  convenience  is  rewritten  here 


x = An(t)x  + A12(t)z  + B1(t)u 


pi  = A21(t)x  + A22(t)z  + B2(t)u 


(3.1) 


where  x £ Rn,  z € Rm,  u € Rr  and  p is  a scalar  which  is  greater  than  zero. 
For  all  t,  the  matrices  A„(t),  i,j  = 1,2,  are  bounded  with  bounded 
derivatives  and  the  eigenvalues  of  of  A22(t)  satisfy 


Re(V  (t))  < -Y  < 0 . 


The  main  results  of  this  chapter  are  the  two  time-scale  property  and  the 
development  of  a near  time-optimal  control  for  (3.1).  The  necessary 
conditions  of  the  minimum  principle  are  applied  to  demonstrate  the  two 
time-scale  property,  that  the  time-optimal  control  of  (3.1)  is  composed 
of  an  initial  interval  of  control  in  a slow  time-scale  followed  by  an 
interval  of  control  in  a fast  time-scale.  Based  on  this  property  a near 
optimal  control  is  proposed  which  is  made  of  a time-optimal  control  of  the 
reduced  order  system 


2.  -1  - -i  - 

x — (A^^  *"  a^2A22  A21^x  (®i  " A12^22 

£ Aq (t)x  + Bq (t)u 


followed  by  a time-optimal  control  of  the  fast  subsystem 


(3.2) 


pz  * A21(t)xJ.  + A22(t)z  + B2(t)u  . 


(3.3) 


Lli 


37 


The  advantage  of  this  near  optimal  control  is  that  it  requires  control 

computations  only  on  reduced  order  systems. 

The  time-optimal  control  problem  treated  here  is  that  of  finding 
★ 

the  control  u (t)  constrained 

u € U - [u:  |uj  < 1,  i * 1, • • • ,r } 

9 

which  steers  the  state  (x',z 'Y  of  (3.1)  from  the  fixed  initial  state 

(Xq',  zq/)/  at  time  tg  to  the  fixed  final  state  (x^, ',  z^/)''  in  minimum 

“fc  *k  “k 

time  T . For  notational  purposes  t^  is  defined  t^  = T + tg.  System  (3.1) 

is  assumed  to  satisfy  the  following  condition  in  order  to  guarantee  the 

k 

existence  of  the  time-optimal  u (t). 

(i)  There  exist  t^  and  p.g  such  that  for  all  p.  fc  (0,pgj 

there  exists  a control  u(t,p.)  fcu  which  steers  (x  /,  z') 
from  (Xg zq/^/  t0  zp/ y within  time  t^-tg. 

According  to  Theorem  17  of  [35,  p.  127],  (i)  guarantees  the  existence  of 

k r -> 

u (t)  for  p.  fc  (0,PgJ.  The  normality  of  the  control  problem  is  assumed. 

3.2  Necessary  Conditions  from  the  Minimum  Principle 


In  this  section  the  minimum  principle  is  applied  to  the  control 
problem.  Then  a diagonalization  transformation  and  expansions  in  u.  are 
used  in  the  necessary  conditions  to  reveal  the  two  time-scale  property. 
The  Hamiltonian  to  be  minimized  is 


H - 1 + p'Au*  + p'Au2  + p'B^ 


+ t + A ,/A2J*  + 1 ’V 


(3.4) 


— — - 
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where 


• , 1 
p = - A, /p  - - A. 


'll  ' p "21  q 


“ ^12/^5  ” ii  q 


P 22 


(3.5) 


and 


u*(t)  = - SGnCb^p  + -i  B2'q] 


(3.6) 


The  normality  of  the  control  implies  that 


B1/(t)p(t)  + -i  B2/(t)q(t)  £ S (t)  = 0 

only  at  isolated  times  t^  and  not  on  a finite  interval.  Thus  the 
switchings  of  the  components  of  the  optimal  control  will  occur  at  the 
instants  t^  at  which  one  of  the  components  s^  i=  l,...,r  of  S satisfies 

si(t.)  = 0 . 

The  instants  t^  are  referred  to  as  the  zeroes  of  S. 

If  we  let  §(t,tQ)  be  the  state  transition  matrix  of  (3.1)  satisfying 


at 


’ A11 

A12 

- P A21 

±A, 
P • 

(3.7) 


then  (3.6)  may  be  written 


* 

u (t) 


» 

. I 

• PF  ■ 

- 

= - SGN 

[Y:  Y] 

. / * 

S-  (tf,t) 

. 

• • • 

1 

- 

*-p  qF° 

(3.8) 


L 7? 

where  pF  and  - qF  are  the  values  of  p(t)  and  q(t)  at  tf.  A candidate 

for  the  optimal  control  could  be  found  by  determining  t,,  p and  q such 

r F 
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that  u (t)  steers  (x',z'')/  from  (Xq/,Zq/)/  to  (xF',Zp/)/.  This  is  in 
general  a difficult  problem  due  to  the  stiffness  which  results  from  the 
presence  of  p. 

The  minimum  principle  provides  the  additional  necessary  condition 


H(tf)  = 0 


which  implies  that  as  p -*  0,  — a remains  bounded  and  therefore  q is 

P F r 

0(p). 

In  order  to  reveal  the  two  time-scale  property  the  diagonalization 
transformation  (2.41)  is  applied  to  (3.1)  and  (3.5)  to  yield 


X = (Au  - FA21)X  + GC  + (Bj_  - FB2)u 


PC  = A2]_X  + (A22  + pA21F)  C + B2u 


(3.9) 


and 


I - - (An  - fa21)'5  - ± An-H 

T,  _ _ G'g  - i (Ajj  + PA21F)'-| 


(3.10) 


where 


G'(t)  = A ' - A-»/F/  - pF'  + pTa.  / - yF'A  'F'  . 


(3.11) 


In  Section  2.4  it  is  shown  that  for  p sufficiently  small  and  t € [t^.t^J 
a solution  of 


is 


0(f)'  = 0 


F(t)'  = A 22/’1A21/  + PW(t,p)' 


(3.12) 
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„hetew  t.ct.ri  is  bo-dad.  Substitution  of  (3.12)  into  (3.9)  and  (3-iO) 


yields 


i - Atx  + <B0  - >*ra2)“ 
;C  - a21x  + a2C  + V 

5-- VS-W1 

W.TI  - - A-'H  . 


(3.13) 


(3.14) 


where  ^ £ AQ  - pWA2L,  A2  £ A22  + ^2]A12A22  + ^ A2lW  and  AQ  and  BQ 
are  as  defined  in  (3.2).  Letting  and  be  the  state 
transition  matrices  satisfying 


~ $22(t,t0)  = A22(t)*22(t’t0) 


(3.15) 


(3.16) 


yields  the  approximate  solution  to  (3.14) 

5(0  * *0/(tf,t)?*  - C§0/(t*,t)A2i/(t*)A2/(tf) 


- A21'(t)A2'(t)’H2'(t*,t)  + 0(O]\ 


1(0  - 522/(tf,t)?T’  + °(^ 


(3.17) 


(3.18) 


where  5^,  and  are 
and  (3.12)  imply 


, c - n t*.  Transformation  (2.41) 

the  values  of  and  at  cf. 


p(t)  « l(t) 


(3.19) 


q(t)  = 'H(C)  ' uA22'(t)  A12 


,(t)'1A1/(t)p(t)  + 0(vO 


t 
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5f  Pf 

flj.  = qF  + ^22^  ^12/^tf^F  + ^ 


(3.20) 


The  substitution  of  (3.17),  (3.18)  and  (3.20)  into  (3.19)  yields 


p(t)  = §0/(t*,t)(pf,  - A21'(t*)A2/(t*)"1qF) 


+ A21/(t)A2^(t)'1$22,(t*,t)qF  + 0(n) 


(3.21) 


q(t)  = ^22/(t*,t)(qF  + uA22/(t*)"1A12/(t*)pF) 


- ^22^(t)"1A12'(t)$0,(tJ,t)(pF  - A21/(t*)A2^(t*)’1qF) 

- MA22/(t)'1A12'(t)A21/(t)A2/(t)“1§22/(t*,t)qF 


(3.22) 


+ 0(H  ) . 


Finally  substituting  (3.21)  and  (3.22)  into  u (t),  (3.6),  yields 


u~(t)  = - SGN[B0/(t)20/(t^,t)(pF  - A21/(t*)A2,(t*)'1qF) 


+ BQ/(t)A21/ (t)A2r(t)  §22/(tf,t)qF 


(3.23) 


+ V 


qf  + A22/^tf^  A12^Cf^PF 


+ 0(h)]  • 


Lemma  2.1  implies  that  for  H small  enough  ^22^C,t0^  is  bounded  by  a 
decaying  exponential  with  0(H)  time  constant.  This  implies  that 
f20/(t^,t)  is  exponentially  decaying  in  reverse  time. 

* 

Lemma  3.1.  The  time-optimal  control  u (t)  is  composed  of  an  initial 
interval  of  control  in  a slow  time-scale  followed  by  an  interval  of  control 


!■ 


42 


in  a fast  time- scale.  In  particular,  for  any  e > 0 there  exists  p and  t 
such  that  for  p € [0,p],  u (t)  satisfies 


-SGN[B0/(t)$0'(t*,t)pF  + 0(p)  + 0(e)] 


t € Ct0,tf-T) 


u*(t)  -SGNCB0/(t)pF  + B2'(t)l22y(t*,t) 


(3.24) 


* <i  iF  + + 0<ii)] 

t € 


where  t = 0(  p) . 


Proof  of  Lemma  3.1.  By  Lemma  2.1  there  exists,  for  p small  enough, 
a constant  K such  that 


«22'(tf,t)l  < Ke 


t-t, 


for  t < t, 


£T 

Letting  Ke  ^ = e implies  T = - ■£•  ln(^)  where  e is  chosen  such  that 
e/K  < 1.  Thus  T = 0(p)  and  for  t < t*-T, 


(3.25) 


Recalling  that  * 0(p)  and  noting  that  t = 0(p)  implies  that 

= I + 0(p)  t € [t*-T,t*] 

and  hence  we  have  (3.24).  For  small  enough  e and  p,  the  switchings  on  the 

r * n * 

slow  interval  It^.t^-TJ  are  primarily  dependent  on  ^'(t^t)  and  the 
switchings  on  the  fast  interval  [t^-T, t*]  of  length  0(p),  are  primarily 

•ft 

dependent  on  B^p^  and  §22/(t^,t).  Thus  the  switchings  on  the  initial 
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interval  are  in  a slow  time-scale  and  the  switchings  on  the  final 
interval  are  in  a fast  time-scale  and  Lemma  3.1  is  proved. 

* 

Since  T is  0(|J.),  x(t^-T)  is  0 ( p.)  from  x^,.  Thus  the  two  time-scale 
property  implies  that  the  control  on  the  slow  control  interval  steers  x 
to  0(|i)  from  Xp.  The  fast  switchings  in  lt^-T, t^J  steer  z to  z^,  and  x 
the  last  O(p-)  distance  to  x^,.  Based  on  this  argument  the  near-optimal 
control  is  proposed  in  the  next  section. 

3.3  Slow  and  Fast  Control 

The  reduced  control  problem  is  that  of  finding  the  control  u (t)  € U 
which  steers  the  state  x of  the  reduced  order  system  (3.2)  from  x^  at  time 

— —‘A’  ■■  tV 

tn  to  x^  in  minimum  time  T . For  notational  purposes  tf  = T + t^.  Since 
U j?  I u 

setting  p.  to  0 in  (3.1)  to  yield  (3.2)  implies  that  z may  be  steered 
instantaneously,  the  reduced  problem  does  not  include  the  steering  of  z. 

The  minimum  principle  yields  the  following  necessary  conditions  for 
the  reduced  control  problem.  The  Hamiltonian  to  be  minimized  for  u € U 
is 


H = 1 + P^qX  + p'BpU 


(3.26) 


where  p satisfies 


P * - AQ(t)p 


(3.27) 


and 


u (t)  = - SGN[B()/p(t)] 


(3.28) 


The  solution  of  (3.27)  is 


P(t)  = - #0'(tf,t)pF 


(3.29) 
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which  when  substituted  into  (3.28)  yields 

^*(t)  - - SGN[B0'§0'(t^,t)pF]  . (3.30) 

We  let  x(t),z(t)  be  the  result  of  applying  u (t)  to  the  full  order  system 

(3.1)  and  x (t)  be  the  optimal  trajectory  for  the  reduced  order  system 

(3.2) . 

■ I'Jlf 

Lemma  3.2.  Let  u (t)  be  applied  to  (3.1).  Then 

x(t)  = x (t)  + 0(h)  (3.31) 

for  t c Ltg,T  J. 

Proof  of  Lemma  3.2.  Let  e(t)  = X(t)  - x(t),  then 
e = Ag(t)e  - |jWA2^(t)X  - 

where  it  is  assumed  that  e(t Q)  = 0.  The  error  e(t)  satisfies 
t 

e(t)  = - M<  J i0(t,T)W(Tfy.)CA21(T)x(T)  + B2(T)u(T)]dT  . 
c0 

On  [t0,^]x(t),  A21(t)  and  B2(t)  are  bounded  and  u € U.  Therefore 
e(t)  = 0(n).  Lemma  2.1  implies  that  there  exists  p.  such  that  for 
H € (0,h)  z(t)  is  bounded  on  [t^.tf]  and  thus  transformation  (2.41) 
implies 

x(t)  * X(t)  + 0(p.)  • 

Finally  this  implies  that 

a mf9c 

x(t)  = x + 0(h) 


and  Lemma  3.2  is  proved. 
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Ths  slow  stata  control  problem  Is  that  of  finding  the  control 

k 

U*(t)  € U which  steers  x of  (3.1)  from  xQ  to  Xj.  in  minimum  time  Tfl.  For 
this  problem  the  initial  value  of  z is  zQ  and  its  final  value  is  free 
which  implies  that  qp  = 0.  Thus  equation  (3.24)  implies 


-SGN[B0'$0'(tf>t)pF  + 0(H)  + 0(e)] 

t 6 Ct0>t*-x] 

-SGNCBq' Pf  + B2,§22/(t*,t)A22'(t*)'1A12/(t*)pF 
+ 0(H)]  b € Ct^-Tjtj] 


(3.32) 


and  u*(t)  also  possesses  the  two  time-scale  property.  On  the  slow 
s 

interval  [t«,t*-T)  the  switchings  of  u (t)  are  primarily  dependent  on 
Of  s 

B 'i  '(t*,t)p  . As  H-0,  t*  - t*  and  by  (3.31)  x(t)  -x*(t).  Therefore 
0 0 ' f F t 

as  H - 0,  PF  - PF* 

Lemma  g.q.  Let  the  zeroes  of  B0'(t)*0'(tf ,t)pp  of  the  reduced  order 
control  (3.30)  be  simple.  Then  for  small  enough  y the  switching  times  of 
u*(t)  on  [tQ,t*-T)  are  0(y3)  different  from  those  of  u*(t) . Furthermore 
the  minimum  time  Tg  for  the  slow  control  problem  satisfied 

T*=T*+0(ya)  (3-33) 

s 


where  a > 0 is  constant. 

Proof  of  Lemma  3.3.  The  fact  that  the  zeroes  of  BQ' (t)§Q/(tf ,t)pp 
are  simple  implies  that  the  control  sequence  of  u (t)  is,  for  y small 

k r * 

enough,  the  same  as  the  control  sequence  of  ug(t)  on  LtQ,tf-T).  The 
choice  of  the  parameter  t is  dependent  on  Che  reverse  time  exponential 
decay  of  ^22/(t*,t)  and  hence  satisfies 
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lim  e = 0 . 

p “*  0 

The  0(e)  and  0(p)  terms  of  (3.32)  shift  the  switching  times  of  "a* (t)  by 
some  p-dependent  variations.  Since  these  variations  go  to  zero  as  u - 0 
there  exists  b > 0 such  that  these  variations  are  O(p^).  For  a finite 

■■  */c 

number  of  switching  instants  t^  of  u (t) 

Ts  = T + 0(p  ) + T 


where  T = 0(p).  Thus  there  exists  a > 0 such  that 


* — * , a 

Tg  = T + 0(p  ) 


and  Lemma  3.3  is  proved. 

Lemma  3.3  suggests  that  by  varying  the  switchings  of  u (t)  by  some 
method  such  as  an  iterative  method  [6]  or  switching  sensitivities  [22] 


and  adding  some  fast  switchings  on  the  T interval  the  control  ug(t)  may  he 
found.  Thus  u (t)  is  a near-optimal  control  for  the  slow  control  problem 
in  the  sense  that  it  steers  x to  0(p)  from  x^,  and  requires  the  near-optimal 
time  T . 

— -k  — * 

The  point  z(t^)  of  the  fast  state  z at  t^  after  the  application  of 

-jy 

u (t)  to  the  full  order  system  will  be  some  finite  distance  from  z for 
• F 

p small  enough,  by  the  stability  Lemma  2.1.  Thus  the  time-optimal  control 

“if  ii 

u^(t)  which  steers  the  state  of  (3.1)  from  z(t^)  to  z will  require  0(p) 
time.  For  any  bounded  control  on  an  0(p)  interval  the  slow  state  x will 
only  be  moved  an  0(p)  distance. 

Let  u^(t)  be  the  control  which  time-optimally  steers  z of  the  fast 
subsystem  (3.3),  where  x^,  is  the  final  slow  state,  from  z(t^)  to  z^  in 
time  T^. 


A 


Lemma  3.4.  The  control 


u*(t) 


— * 

u (t) 

5j(t) 


t ^ [t0,tf) 

t € CtJ.tJ  + fj] 


(3.34) 


is  a near  time-optimal  control  for  (3.1)  in  the  sense  that  it  steers 
(x',z')‘  to  a point  which  is  0(h)  from  • (x^.',  ')'•  Furthermore 

T*  = T*  + 0(H)  + 0(p,a)  • (3.35) 


Proof  of  Lemma  3.4.  As  discussed  above  = 0(H)  and  therefore 
x(cf  + Tf)  = Xj,  + 0(h)  • 

Since  x will  be  0(h)  from  x^,  on  [t^t^  + T^),  u£(c)  will  steer  z from 
— * 

z(t-)  to  0(p.)  from  z_.  The  control  u (t)  requires  time  T + T_  and 
t r F 

therefore  by  equation  (3.33) 

T*  = T*  + 0(h)  + 0(Ha)  • 

ic 

The  time  T for  time-optimally  steering  both  x and  z must  be  greater  than 
* 

Ts«  Since  it  is  possible  to  steer  the  fast  state  in  0(h)  time,  equation 
(3.35)  is  satisfied  and  Lemma  3.4  is  proved. 

As  pointed  out  in  the  introduction,  Section  3.1,  the  near  optimal 

control  is  computed  as  the  time-optimal  controls  for  two  reduced  order 

systems  (3.2)  and  (3.3).  As  developed  here,  this  control  requires  the 
■ ★ 

knowledge  of  z(t-)  which  may  be  found  from  integration  of  system  (3.1) 

— *■ 


with  the  control  u (t).  Singular  perturbation  techniques  are  well 
developed  for  such  differential  equations  and  imply  that  for  h small  enough 


A 
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* 


z(tf)  = z(tf)  + 0(p.) 


(3.36) 


where 


z = - A22  1A21x  - A22  1B2u  . 


(3.37) 


— — it  — ★ 

Thus  the  use  of  z(t^)  instead  of  z(t^)  will  introduce  another  O(^) 
error  in  z when  it  is  not  possible  to  compute  z(t^). 

The  following  lemma  makes  it  possible  to  replace  the  assumption  of 
the  normality  of  the  time-optimal  control  for  the  full  order  system  (3.1) 
by  the  assumption  of  the  normality  of  the  time-optimal  controls  for  the 
reduced  order  system  (3.2)  and  the  fast  system  (3.3). 


Lemma  3.5.  Assume  that  the  reduced  order  control  u (t)  and  the  fast 

~ic  it 

control  u^(t)  are  normal.  Then  for  small  enough  p the  control  u (t)  is 
normal . 


Proof  of  Lemma  3.5.  The  minimum  principle  implies  that 


a*(t)  = - SGN[i  B2'(t)$22'(t^  + T^,t)qF]  . 


(3.38) 


L 


Let  F(t)  = V(t)§2/(tf  + Tf’C)  and  S(t)  = B0/(t)§^(tf,t).  The 
normality  assumption  on  u (t)  and  u^(t)  implies  that  the  components  f^(t) 
and  s^(t),  i = l,...,r  of  F(t)  and  S(t)  satisfy 


and 


ft(t)  * 0 


si(t)  = 0 


at  isolated  instants  of  time  t ^ . By  equation  (3.24)  for  u (t)  the 

— ★ * r * 

normality  of  u (t)  implies  the  normality  of  u (t)  for  t 'c  it^.t^  - t)  and 
the  normality  of  u (t)  implies  the  normality  of  u (t)  for  t c It  - T,t.J 


and  therefore  Lemma  3.5  is  proved. 
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4.  THE  INTERMEDIATE  POINT  ALGORITHM 

4.1  Introduction 

This  chapter  presents  the  intermediate  point  algorithm  which  exploits 
the  two  time-scale  property  developed  in  the  last  chapter.  As  pointed  out 
there,  the  time-optimal  control  can  be  viewed  as  being  made  of  two  parts. 
The  first  part  is  primarily  concerned  with  the  control  of  slow  states 
while  the  second  part  is  primarily  concerned  with  the  control  of  fast 
states.  Corresponding  to  the  two  parts  of  the  control,  the  optimal 
trajectory  is  divided  into  two  parts.  The  point  lying  at  the  intersection 
of  the  slow  and  fast  part  of  the  optimal  trajectory  is  called  the  inter- 
mediate point.  The  intermediate  point  algorithm  consists  of  iterations 
for  the  intermediate  point  based  on  computations  for  lower  order  systems. 
From  the  intermediate  point  the  slow  and  fast  parts  of  the  control  are 
calculated. 

The  system  for  which  the  algorithm  is  developed  is 

x = A^x  + B^u  (4.1a) 

- A2z  + B2u  (4.1b) 

where  x € Rn,  z € Rm,  u € Rr  and  p.  > 0.  In  [3]  a transformation  is 
presented  which  transforms  linear  time-invariant  singularly  perturbed 
systems  to  the  form  of  (4.1).  The  eigenvalues  of  A^  satisfy 

Re(^)  < 0 , (4.2) 

u is  constrained 

u € U * {u:  |u^  | < 1,  i * 1 rj  (4.3) 
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and  (4.1)  is  completely  controllable  from  each  component  u^  of  control. 

This  controllability  implies  the  normality  of  the  problem  treated  here 

which  is  that  of  finding  the  control  u*(t)  which  steers  (x/,z/)/  of  (4.1) 

★ 

from  (Xq*',  ZqO'  at  t^  * 0 to  (x^/,  zj//  in  minimum  time  T . For  small 
(4.1a)  is  referred  to  as  the  slow  subsystem,  (4.1b)  as  the  fast  subsystem 
and  x and  z as  the  corresponding  slow  and  fast  states. 

Systems  (4.1a)  and  (4.1b)  are  equivalent  to  the  systems  referred  to 
as  the  reduced  order  and  fast  subsystems  in  the  last  chapter.  Conse- 
quently Lemma  3.4  implies  that  a near-optimal  control  is 

Ug(t)  t € [o,T3) 

u(t)  * A (4.4) 

Gf(t)  t € [Ts,T) 

/s 

where  ug(t)  time-optimally  steers  x from  x^  to  x^,  and  u^(t)  time-optimally 

steers  z from  z(Ta)  to  z^,.  This  control  is  near  optimal  in  the  sense  that 

* * 

it  steers  z to  z^.  and  x to  0(|i)  from  x^,  and  T = T + 0(n). 

4.2  The  Intermediate  Point  Algorithm 


Since  every  part  of  a time-optimal  trajectory  for  (4.1)  is  an  optimal 

★ 

trajectory,  the  time-optimal  control  u (t)  can  be  viewed  as  a concaten- 
ation of  two  optimal  controls 

*;t)  t € [0, tx) 

(4.5) 


* 

u (t) 


u*(t)  t 6 [tT,T~) 


where  t^  is  some  intermediate  time  when  the  state  x is  at  an  intermediate 

r 

point  x(t^).  The  interval  Lt^,T  J is  expected  to  be  considerably  shorter 

J. 

than  [0,0.  The  controls  a (t)  and  u,(t)  are  time-optimal  to  and  from 
I st 

ths  intermediate  point.  We  denote  the  slow  and  fast  parts  of  the  inter- 
mediate by  x^  and  z^,  that  is,  x(t.)  * x^  and  z(t^)  = z^. 


J 


51 


The  name  of  the  intermediate  point  algorithm  comes  from  the  fact 
that  this  algorithm  iterates  for  an  intermediate  point,  which  lies  on 
the  optimal  trajectory.  In  each  iteration  the  nth  iterate  for  the  inter 
mediate  point  is  denoted  by  (x^.z^11)  and  the  corresponding  iterate  for 
the  intermediate  time  is  denoted  t^n. 

The  first  guess  for  the  intermediate  point  is  provided  by  the  near 
optimal  control  (4.4).  The  intermediate  time  is  defined 


The  slow  intermediate  point  is  defined 


XI  =XF 


The  controls  u (t)  and  u,(t)  are  thus  first  guesses  u ^(t),  u,^(t)  for 
Si  SI 

u (t)  and  u,(t)  respectively.  As  the  next  guess  x ^ for  x , we  define 


the  point  which  will  be  steered  to  x^,  by  uf  (t).  In  general,  if  we  have 
an  nth  guess  ufn(t)  for  u*(t),  then  x^*1  which  is  transferred  to  Xp,  by 
ufn(t)  is 


n+1 


e *F  - J 


Tn  A1(tin-T) 


n 


B1uf  (T)dT 


(4.6) 


where  T°  - t^n  is  the  minimum  time  in  which  u^n(t)  steers  z from  z^n  to 
Zp,.  On  the  other  hand  z^."  is  the  point  to  which  usn(t)  steers  z to  z^, 
while  optimally  steering  the  slow  state  x from  Xq  to  x^n  in  time  t^”, 
that  is, 


B2Us‘(T)dT  . 


,tTn\ 

Cl 

rt 

1 

P 

t 

* 

C T \ 

i 

1 V- 1 

.If* 

+ ~ J e 

» ) 

= e 


(4.7) 
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Expressions  (4.6)  and  (4.7)  for  x^1*  and  z n and  the  definitions  of 
* n 

u (t)  and  u * (t)  constitute  the  intermediate  point  algorithm.  The  nth 

S I 

iteration  consists  of  the  following  four  steps: 


Step  1.  Find  u (t)  which  steers  x of  (4.1a)  from  x to 

S U 

x^n  in  minimum  time  t^  . 

Step  2.  Evaluate  z ^ from  (4.7). 

Step  3.  Find  u^n(t)  which  steers  z of  (4.1b)  from  z^n  to 

z-  in  minimum  time  Tn  - t_n. 
r I 

Step  4.  Knowing  u^n(t),  Tn  and  t^n  from  the  n-1  step,  evaluate 
xTCV+’1  from  (4.6). 


The  procedure  initialized  with  x 


I -v 


It  terminates  when 


l n+1  a _ 

l*j  " xj  K « 


(4.8) 


where  e is  a preassigned  scalar  such  as  s = 10~^.  Then  u^n(t)  will  steer 


(x',z')'  from  xzn  .z^  )'  to  (xp/,zF/) 


. Thus  the  control 


u°(t) 


ugn(t)  t € [O.t^) 


ufn(t)  t € Ct^.T*1] 


(4.9) 


will  steer  (x',zy)/  from  (Xq',zq')'  to  (x_/, zJ./)/,«  A test  for  the 
optimality  of  this  control  is  given  in  Lemma  4.1. 

The  convergence  properties  of  this  algorithm  have  not  been  treated 
with  any  rigor.  What  is  known  is  that  for  a number  of  examples  which  are 
presented  in  [5]  the  convergence  was  quite  rapid,  sometimes  in  less  than 
ten  iterations.  The  idea  which  led  to  the  development  of  the  algorithm 
which  may  contain  the  seed  for  a convergence  proof  is  simple.  For  u 


small  enough  the  last  interval  of  constant  control  of  ug  (t)  will  be  long 
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enough  so  that  z^.n  will  be  arbitrarily  close  to  its  equilibrium  value  at 
the  end  of  the  interval.  In  this  case  making  an  0(p)  change  in  x^11  to 
yield  xjn+^  will  not  appreciably  change  z^n  and  therefore  u^n(t)  will 
change  only  a small  amount.  The  condition  that  remains  to  be  discovered 
is  that  under  which  the  small  changes  in  u^n(t)  are  in  the  proper 
direction  for  convergence. 

The  computational  efficiency  of  the  algorithm  depends  on  Steps  1 
and  3 which  require  the  calculation  of  time  optimal  controls  for  the 
reduced  order  systems  (4.1a)  and  (4.1b).  For  second  order  subsystems 
phase  plane  techniques  can  be  used.  Thus  for  fourth  order  systems  with 
second  order  subsystems  the  algorithm  can  be  easily  implemented.  For 
higher  order  systems  one  of  two  approaches  can  be  used.  The  first  is  to 
apply  other  computational  methods  for  time-optimal  control  such  as  those 
in  [15,22].  The  other  approach  is  to  see  if  the  eigenvalues  of  the  system 
are  such  that  the  system  may  be  broken  down  into  more  than  two  subsystems. 
The  advantage  of  the  intermediate  point  algorithm  is  that  for  systems  with 
smaller  p.  the  convergence  should  be  better.  Thus  the  order  of  computations 
that  are  needed  are  reduced  and  the  difficulties  characteristic  of  stiff 
systems  are  removed  since  the  stiff er  the  systems,  the  faster  the  con- 
vergence. This  indeed  is  the  reason  for  the  development  of  singular 
perturbation  methods  for  control  theory. 

As  in  most  optimization  procedures,  a control  to  which  the  procedure 
has  converged  and  the  corresponding  final  time  T,  must  be  tested  for 
optimality.  When  this  algorithm  has  converged  the  control  steers  the 
state  of  (4.1)  from  the  initial  state  to  the  final  state  to  within  the 
desired  accuracy.  The  next  step  is  to  see  if  the  necessary  conditions  of 
the  maximum  principle  are  satisfied.  Let  the  rows  of  and  be 
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f\\f 

denoted  by  and  b^  > j = l,...,m.  Then  from  equation  (3.6)  the 

"Jc 

components  of  u (t)  are 

u*(t)  = - sgnCb^^  p(t)  + ^ b2(j)/q(t)]  . (4.10) 

As  the  kth  switching  instant  t^  of  any  component  u^(t) 


b1(j)/P(t.k)  +^b2(j)'q(t.k)  = 0 . 


(4.11) 


This  implies 


. ★ ^ 


(4.12) 


As  the  final  time  the  Hamiltonian  satisfies 


H(T  ) 


which  implies 


K ■ 1 + VVf  + PfV  + £ Wf  + \ W ‘ 0 


(4.13) 


These  are  the  conditions  of  the  minimum  principle  that  remain  to  be 
satisfied  which  leads  to  the  following  conclusion. 


Lemma  4.1.  Suppose  the  p and  q can  be  found  such  that  (4.13)  is 
satisfied  and  such  that  for  each  switching  instant  t^n  of  each  component 
of  control,  (4.12)  is  satisfied  for  the  switching  instants  and  final  time 
of  the  control  computed  by  the  intermediate  point  algorithm.  Then  the. 
control  computed  from  the  intermediate  point  algorithm  satisfies  the 
necessary  conditions  of  the  minimum  principle  and  is  therefore  a candidate 
for  the  optimal  control. 

From  an  engineering  standpoint  we  may  be  satisfied  if  the  control 
ufn(t)  requires  a time  considerably  shorter  than  u n(t),  since,  as  was 


- B am  -■ 
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discussed  in  the  last  chapter  u ^(t)  should  require  a time  0(y.)  different 

s 

from  the  minimum  time  needed  to  steer  x from  x^  to  x^,.  Thus  if  with  an 

additional  short  interval  of  control  added,  x and  z can  both  be  steered 

to  their  final  values  (x^.z^,),  the  control  if  not  optimal,  may  satisfy 

our  needs  anyway.  In  this  statement  it  has  been  assumed  that  ugn(t) 

requires  approximately  the  same  amount  of  time  as  u ^(t)  since  x n 

S i 

should  be  0(p.)  from  x^,. 

4.3  Example 


System  (4.14)  represents  a generator  driving  a pump  which  pumps 
water  into  a reservoir.  Water  flows  from  this  reservoir  into  a second 
reservoir  at  a rate  proportional  to  x^,  a small  change  in  water  depth 
around  the  nominal  value  x^.  Water  also  flows  out  of  the  second  reservoir 
at  a rate  proportional  to  x^,  a small  change  in  water  depth  around  the 
nominal  depth  x^« 

The  reservoir  depths  x^  and  X2»  the  angular  velocity  z^  of  the  motor 
shaft  and  the  armature  current  z^  are  the  state  variables  chosen  to  model 
this  system. 
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(4.14) 


The  parameter  (J.  is  inversely  dependent  on  the  friction  in  the  motor  and 
pump.  The  control  u is  the  normalized  armature  voltage. 


Substituting  possible  values  into  (4.14)  yields 


— 

- 

- 

9 * 

" — 

*1 

-1.5  0 

0 

0 

X1 

0 

*2 

1.5  -2 

0 

0 

X2 

0 

3 

+ 

0 0 

-4.54 

.091 

21 

0 

*2 

0 0 

100 

-100 

*2 

100 

• — 

_ 

(4.15) 


The  initial  state  chosen  for  this  example  is 


- 

• * 

X10 

13 

X20 

6 

Z10 

5 

Z20 

45 

m m 

(4.16) 


The  problem  is  to  steer  the  state  of  this  system  [x^  x^'  from  the 

initial  state  to  the  origin  in  minimum  time.  In  order  to  accomplish  this 
a linear  transformation  is  applied  to  (4.15).  The  system  eigenvalues  are 
-1.5,  -2.0,  -4.448  and  -100.0952.  From  the  values  of  these  eigenvalues 
the  matrix  of  eigenvectors  is 


10  1 1 
31-  .6127  - .0152 

0 0 -2.9848  -98.595 

0 0 -2.9803  103,530 


(4.17) 


and  its  inverse  is 


1 

-3 

0 

0 


0 

1 

0 

0 


.33472  .309X10"3 

-1.2092  -.112X10"2 

- .33471  -.318X10-3  j 

-5  -5  * 

- .963X10  3 -.965X10  3 ! 


(4.18) 
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A normalization  matrix  is 


+.02087 

0 

0 

0 


0 0 

-.05684  0 

0 -.007255 


0 
0 
0 

-.00000964 


(4.19) 


Applying  the  transformation  (x',z')  = PT(X',  £ )'  to  (4.15)  yields 


— * 

• 

• 

w - 

■*  — 

*1 

-1.5 

0 0 

0 

*1 

-1.5 

*2 

0 

-2.0  0 

0 

*2 

-2 

<3 

0 

0 -4.448 

0 

S 

-4.448 

^4 

0 

0 0 

-100.0952 

S 

-100.0952 

_ 

_ J 

- 

u (4.20) 


and  the  corresponding  transformed  initial  state 

-i 


-2.2168 


1 

0 

r-t 

X 

\ 

[ 

0 

04 

X 

11 

So 

1 

0 

.13455X10 

.64924X10 


-7 


(4.21) 


After  ten  iterations  of  the  intermediate  point  algorithm  the 
control 

-1  t € [0,  1.0058) 

+1  t € [1.005,  1.5829) 

-1  t € [1,5829,  1.7281) 

+1  t 6 [1.7281,  1.7350) 

was  found  which  steers  the  state  to  a final  point 


u *< 


(4.22) 


t i 


-.7X10 


-.9X10 


>.5X10 


-.2X10 


(4.23) 


This  is  equivalent  to  the  final  state 


1.46X10 


-5.1156X10 


1.082X10 


-1.99X10 


(4.24) 


This  example  demonstrates  the  use  of  a linear  transformation  in 
conjunction  with  the  intermediate  point  algorithm  to  find  the  time- 
optimal  control  for  a system  not  in  the  form  of  system  (4.1). 
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5.  TIME-OPTIMAL  CONTROL  OF  A CLASS  OF 
NONLINEAR  SYSTEMS 

5.1  Introduction  and  Problem  Statement 

In  this  chapter  the  problem  treated  is  that  of  finding  the  control 
u (t)  which  time-optimally  steers  the  state  (x zy)'  of  the  nonlinear 
system 

x = f(x,t)  + F(x,t)z  + B1(x,t)u  (5.1a) 

HZ  = g(x,t)  + G(x,t)z  + B2(x,t)u  (5.1b) 

from  an  initial  fixed  point  (Xq/,Zq/)/  at  time  t^  to  a final  fixed  point 
(Xp/, Zp,/)/»  Here  x € Rn,  z € R31,  p.  > 0 and  u ( Rr  is  constrained 

u € U = {u:  |u^|  < 1,  i - l,...,r}  . (5.2) 

The  vectors  f(x,t)  and  g(x,t)  and  the  matrices  F(x,t),  G(x,t),  B^(x,t) 

and  B2(x,t)  have  bounded  derivatives  with  respect  to  t and  x.  For  piece- 

wise  constant  controls  u(t)  € U with  a finite  number  of  discontinuities 

and  the  corresponding  trajectories  x(t)  and  z(t),  the  inverse 
- -1 

G(x(t),t)  exists.  Also  the  homogeneous  system 

pw  =*  G(x(t) , t)w  (5.3) 

has  w = 0 as  a uniformly  asymptotically  stable  equilibrium  as  discussed 
in  Section  2.5. 

The  existence  and  normality  of  the  time-optimal  control  of  system 
(5.1)  is  assumed. 

The  chapter  is  organized  as  follows.  In  Section  5.2  the  necessary 

"ft 

conditions  for  the  time-optimal  control  u (t)  of  (5.1)  are  stated.  From 


these  conditions  a (t)  is  shown  to  consist  of  two  parts.  The  first  part 
is  made  of  switchings  in  a "slow"  time- scale  while  the  second  part  is 
made  of  switchings  in  a "fast"  time-scale.  This  is  referred  to  as  a two 
time-scale  property.  The  reduced  order  system  is  defined  in  Section  5.3. 
From  the  time-optimal  control  of  the  reduced  order  system  a near  time- 
optimal  control  is  proposed  and  the  sense  in  which  it  is  near  optimal 
demonstrated.  An  example  is  presented  in  Section  5.4. 

5.2  Necessary  Conditions 

In  this  section  the  necessary  conditions  of  the  minimum  principle 
[36]  are  applied  to  the  time-optimal  control  of  (5.1).  The  Hamiltonian 
is 

H = 1 + p'f  + p'Fz  + p'BjU  + “ q'g  + “ q'Gz  + q'B2u  (5.4) 

where  the  costates  p and  q,  which  correspond  to  x and  z respectively, 
satisfy 

P - - fx'P  “ (Fz)x'p  - (B1u)x/p 

- i gx'  q “ i (Gz)x/q  “ i (B2u)x/q  » (5-5a) 

q ■ - P'p  - ” G/q  . (5.5b) 

Here  the  Jacobian  of  a vector  is  indicated  by  the  subscript.  The  control 
which  minimizes  the  Hamiltonian  is 

u*(t)  =•  - SGN[B1/p  + i B2'q]  . (5.6) 

The  normality  of  the  time-optimal  control  implies  that 
B1  /p  + B^^q  = S(x,z,t,  p.)  = 0 


I 


only  at  Isolated  times  t^  and  not  on  a finite  interval.  Thus  the 
switchings  of  the  optimal  control  will  be  determined  by  the  instants  t^ 
at  which  one  of  the  components  s^,  i = l,...,r  of  S satisfies 


si(x,z,tj,  p.)  =»  0 


The  instants  t^  are  referred  to  as  the  zeroes  of  S. 
For  the  sake  of  simplicity  let 


Au(x,z,t)  = fx(x,t)'  + (F(x,t)z)x/  + (B1(x,t)u)x/, 
A12(x,z,t)  = gx(x,t)'  + (G(x,t)z)x'  + (B2(x,t)u)x/ , 
A21(x,t)  = F(x,t)/ , 

A22(x,t)  = G(x, t)7  . 


(5.7) 


Substituting  (5.7)  into  (5.5)  yields 

1 . 

P = ' AUP  - Z A129 

11  ^ (5.8) 

q = - A21p  - i A22q  . 

We  wish  to  find  approximate  solutions  for  p and  q to  substitute  into  (5.6) 
to  reveal  the  two  time-scale  property.  Therefore  we  apply  the  trans- 
formation 


§ - P 

11  * i q + A22’1a21P 


(5.9) 


to  (5.8)  to  yield 


AQ(x,z,t)  ~ Aj^  - ^2^22  A21 

M(x,z,t)  = kn  A21  + A22  A21  - A22  AnAu 

+ a22  a21a12a22  a21 

and 

^2 (x>  z> t , = &22  ^22  ^21^12  * 

Consider  any  candidate  u(t)  for  the  optimal  control  and 
trajectory  x(t),z(t).  Substituting  u(t),  x(t)  and  z(t) 
and  A2  results  in  the  time- varying  system 

1 = - A0(t)S  - A12(t)Tl 

T)  = M(t)S  - £ A2(t)T]  , 

and  the  solution  to  (5.12)  is 

t 

5(t)  = *0'(tf,t)SF  - J l0/(T,t)A12(T)Tl(T)dT 

t 

Tl(t)  * §2/(tf,t)TlF  + J*  §2/(T,t)M(T)5(T)dT 

Cf 

where  §F  and  are  the  values  of  § and  T]  at  the  final 
state  transition  matrices  and  §2  satisfy 

h *o 

and 


(5.11a) 

(5.11b) 

(5.11c) 

the  corresponding 
into  Aq>  A12,  M 

(5.12) 

(5.13a) 

(5.13b) 

time  t^  and  the 

(5 . 14a) 


TT  $,'(t-,t)  = - A,(t)$  '(t  ,t)  . 


(5.14b) 
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Since  M(T)  and  §(t)  are  bounded  for  t € [t^.t^],  Lemma  2.6  implies  that 


T)(t)  = $2'(tf,t)T)f  + 0(H)  . 


Substituting  (5.15)  into  (5.13a)  yields 


(5.15) 


?(t)  = *0'(tf,t)Sp  - J (T,t)A12(T)#2/(tf>T)TlfdT 

Cf 

+ 0(h) 


(5.16) 


from  which,  by  Lemma  2.7,  we  find 


S(t)  - *0(t£,t)'Sp  + o(h)  . 


(5.17) 


Rather  than  derive  approximate  solutions  for  p and  q from  (5.9),  (5.17) 
and  (5.15)  it  is  simpler  to  apply  transformation  (5.9)  to  (5.6)  and 
substitute  (5.17)  and  (5.15)  into  the  result  to  yield 

u*(t)  - - SGHCBj/#0/(t*,t)§p  + B*'§2,(t*,t)T1F  + 0(H)]  (5.18) 


where  Bq/  =>  B^/  - B^ ^21  * the  * denotes  that  bq'  and  ^2  are 
evaluated  along  the  optimal  trajectory  and  t^  is  the  optimal  final  time. 
Analysis  of  (5.18)  yields  the  two  time-scale  property. 

★ 

Lemma  5.1.  The  time-optimal  control  u (t)  is  composed  of  an  initial 

interval  of  switchings  in  a slow  time-scale  followed  by  an  interval  of 

* 

switchings  in  a fast  time-scale.  Furthermore  u (t)  satisfies 


u*(t) 


-SGNCb*'  #0'(tJ,t)Sp  + 0(h)  + 0(e)]  t € [t0,t*-T) 
-SGN[b*'  ?p  + B*'  + 0(H)]  t € [t*-T,t*) 


(5.19) 


where  $ (t  t)  is  the  state  transition  matrix  of  a system  which  is 
0 

uniformly  asymptotically  stable  and  e may  be  chosen  arbitrarily  small. 
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Proof  of  Lemma  5.1.  The  proof  proceeds  as  follows.  First  it  is 
shown  that 

V^f’10  = V(tf,t:)  + 0(^  • (5*20) 

Then  based  on  (5.20)  the  two  time-scale  property  is  revealed. 

Consider  the  homogeneous  system 

ph  =*  - (A22  + tjA22"lA21A12)h  (5.21) 

with  final  state  at  time  t^.  The  solution  to  (5.21)  is 

h(t)  =■  V^’t^  * (5.22) 

Since  (5.3)  is  uniformly  asymptotically  stable  the  homogeneous  system 

W-y  = A22(t)y  = - G(x(t),t)y  (5.23) 

with  state  transition  matrix  § /(t,,t),  is  uniformly  asymptotically  stable 

P*  £ 

in  reverse  time.  In  terms  of  (5.22)  may  be  approximated 


h(t)  = */(tf»t)TV  + HW^.t)”^  . (5.24) 

Substituting  (5.22)  and  (5.24)  into  (5.18)  yields 

u*(t)  - - SGN[B*'$0'(t*,t)5F  + B/l^t^t/TTj,  + O(-p)]  . (5.25) 

For  any  s,  there  exists  a and  a t such  that  if  p £ (0,Pq] 

|$^'(tf,t)  | < e for  t £ [tQ,tf-T]  . (5.26) 

Furthermore  T = 0(p)  which  implies 

*0'(tfft)  » I + 0(p)  t £ [tf-T,tf]  . (5.27) 
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Equations  (5.25),  (5.26)  and  (5.27)  thus  imply  (5.19).  The  initial  or 
slow  control  interval  is  Lt^t^-T)  and  the  final  0(p)  interval,  called 
the  fast  control  interval  is  [t^-T,t^].  Since  t is  0(p.),  (5.19)  implies 
that  the  switchings  in  the  slow  control  interval  are  primarily  dependent 
on  BQ/$Q/(t^,t)5j,  for  p.  small  enough.  Similarly  in  the  fast  control 
interval  the  switchings  are  primarily  dependent  on 

V^F  + • <5-28> 

Lemma  5.1  is  proved. 

One  additional  fact  can  be  discerned  from  equation  (5.19)  for  the 

control.  After  the  completion  of  the  slow  control  interval  the  slow  state 

x will  be  within  0(p.)  of  x since  T is  0(p.).  This  agrees  with  our 

r 

intuition  since  in  a system  in  which  some  states  can  be  steered  much  more 
rapidly  than  others,  it  makes  sense  that  the  time-optimal  control  should 
first  concentrate  on  steering  the  slow  states  near  to  their  final  state 
and  then  steer  the  fast  states  rapidly  to  their  final  states  while  also 
steering  the  slow  states  the  last  small  distance.  Based  on  this  under- 
standing of  (5.19)  the  near-optimal  control  will  be  proposed  in  the  next 
section. 

5.3  Slow  and  Fast  Control 

The  reduced  order  system  is  defined  and  the  near-optimal  control  is 
proposed  in  this  section.  Setting  p.  to  zero  in  (5.16)  yields  the  reduced 
order  system 


. ..  . . — 


M 
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x =*  fQ(x,t)  + BQ(x,t)u 

■ f(x,t)  - F(x,t)G(x,t)_1g(x,t) 

+ [B-^x.t)  - F(x,t)G(x,t)  1B2(x,t)]u  . 


(5.29) 


The  reduced  order  time-optimal  problem  for  this  system  is  that  of  finding 
— ★ — 

the  control  u (t)  which  steers  the  state  x(t)  of  (5.29)  from  the  initial 
point  Xq  at  time  t^  to  the  final  point  x^,  in  minimum  time  T . 

The  minimum  principle  provides  the  following  necessary  conditions 
for  the  reduced  problem: 


H = 1 + p'fQ(x,t)  + P/Bq(x, t)u  , 


(5.30) 


P = - fn(x,t)'/p  - (B_(x,t)u)j' p , 


(5.31) 


u (t)  = - SGN[B0/(x,t)p(t)]  . 


(5.32) 


Along  an  optimal  trajectory  x (t)  system  (5.31)  is  written 


P * - AQ(t)p 


(5.33) 


which  has  the  solution 


p * *0'(tf,t)pp  . 


(5.34) 


Substituting  (5.34)  into  (5.32)  yields 


n (t)  « - SGN Cb . / ('x  .t')§../(t,.,t)p_] 
U ' * - u f ■ • r 


(5.35) 


—7* 

where  t^  is  the  optimal  final  time. 

★ 

We  consider  the  slow  control  problem  of  finding  the  control  us(t) 
which  steers  x of  (5.1)  from  x^  at  time  t^  to  x^,  in  minimum  time  Tg  and 
show  that  a (t)  is  a near-optimal  control  for  this  problem. 
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Lemma  5.2.  Let  x(t),z(t)  be  the  trajectory  corresponding  to  the 
application  of  u (t)  to  system  (5.1)  and  x (t)  be  the  optimal  trajectory 
for  the  reduced  order  system.  Then 


x(t)  = x (t)  + 0(p.)  . 


(5.36) 


Proof  of  Lemma  5.2.  Let  e(t)  = x(t)-x(t).  The  trajectory  x(t),z(t) 
satisfies 


x = f(x,t)  + F(x,t)z  + B^(x, t)u* 

• ^ ^ ^ 
pa  » g(x,t)  + G(x,t)z  + B2(x,t)u  . 


(5.37) 


Let  t^  be  the  instants  at  which  any  of  the  components  of  u (t)  switches. 
Then  on  any  interval  [t^, t^+^) , equation  (2.38)  implies  that 

z(t)  « §^(t,ti)[zi  + G(x(ti),ti)”1g(x(ti),ti) 

+ B2(x(ti),ti)H*(ti)]  - G(x,t)_1g(x,t)  (5.38) 

- G(x,t)_1B2(x,t)u*(t)  + 0(p.)  . 


Thus  on  [t.,t. , ,) 
i i+l' 

* A A J^- 

x = f0(x,t)  + Bq(x,  t)u  + 0(p.) 

+ F(x,t)$^(t,ti)W(ti) 

where 

W(tt)  = z£  + G(x(ti),ti)"1g(x(ti),ti) 
+ B,(x(t  ),t  )u* (t  ) . 


(5.39) 


(5.40) 


for 


Let  x (t)  be  the  point  for  all  t satisfying  the  Taylor  theorem 

til 


fn(x,t)  + Bn(x,t)u  = fn(x,t)  + Bn(x,t)u 
u u u u (5.41.: 

+ (f0(xa(t),t)  + B0(xa(t),t)u*)xe  , 

then 

e - (f0(xa(t).t)  + B0(xa(t),t)u*)xe  + 0(h) 

+ P(i,t)*tk(t,t1)W(ti)  (5.42] 

£ A(t)e  + 0(n)  + F(t)$^(t,ti)W(ti)  . 

The  solution  to  (5.42)  is 


e(t)  =■  §e(t,ti)e(ti)  + 0(p.) 
t 

+ J*  $e(t,T)F(T)$^(T,ti)W(ti)dT 
Ci 

and  thus  by  Lemma  2.2 

e(t)  * §e(t,ti)e(ti)  + 0(H)  • 


(5.43! 


(5.44: 


Since  e(tQ)  * 0,  e^)  =»  0(h)-  If  e(tt)  = 0(h),  then  e(ci+1)  = 0(h)  by 
(5.44)  and  thus  since  there  are  a finite  number  of  t 

e(t)  * 0(h)  t € [t„,t£] 

U 4. 


and  therefore 

x(t)  = x* (t)  + 0(h) 
and  Lemma  5.2  is  proved. 
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For  the  slow  control  of  (5.1)  the  final  value  is  free  and  there- 


fore q = 0.  Thus  transformation  (5.9)  implies  that 
F 


- PF 

Tip  = A22 


(5.45) 


where  A 22  ^ and  A21  are  eva^uate^  along  the  optimal  trajectory.  Sub- 
stituting (5.45)  into  (5.25)  yields 


u*(t)  = - SGN[B*"§0'(t*,t)pF 


+ B*/§^(t*,t)A22’1(t*)A21(t*)pF  + 0(H)] 


(5.46) 


which  implies  that  u (t)  also  possesses  the  two  time-scale  property.  On 

s 

an  initial  interval  [tQ,tg+Ts~T]  the  switchings  are  primarily  dependent 

on  the  term  Bft/$ft/(t.,t)p  . As  h goes  to  zero  the  optimal  trajectory 

u u r F 

^jf  5^  in 

xg(t)  corresponding  to  ug(t)  converges  to  x (t)  and  ug(t)  converges  to 

u*(t).  Therefore  B*<'iQ/(t*,t)pF  converges  to  Bq^x*, t)§0'( tf,t)pF  of 

'fc  k k 

equation  (5.35).  The  fast  switchings  of  u (t)  dependent  on  B,'$  /(t,,t) 

s 4 y<  r 

takes  place  on  the  0(h)  interval  [t  +tn-T,T  +t_J. 

s u s u 

— k rr  * 

Lemma  5 .3 . Let  the  zeroes  of  BQy(x  , t)?^ /(t^, t)pF  of  (5.32)  be 
simple.  Then  there  exists  a h such  that  for  H ^ (0>H  ] the  switching 

★ r k ^ — *£■ 

times  of  u (t)  on  Lt„,t_+T  -Tj  are  0(h)  different  from  these  of  u (t). 
s u u s 

k 

Furthermore  the  minimum  time  Tg  for  the  slow  control  problem  satisfies 

* — * , a. 

T = T + 0(h  ) 
s 


where  T is  the  minimum  time  for  the  reduced  order  problem  and  a > 0 is 
constant. 


■ 
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-ft  -ft 

Proof  of  Lemma  5.3.  The  fact  that  the  zeroes  of  Bq'(x  ,t)SQ'(t£,t)pF 
are  simple  implies  that  the  control  sequence  of 


u*(t)  = - SGNk0/(x*,t)«0'(tJ,t)pF 


(5.47) 


+ 0(p.)  + 0(e)]  t € Ct0,tQ+Ts-T] 


is,  for  p,  small  enough,  the  same  as  that  of  u (t).  Here  the  choice  of  s 

ft 

is  dependent  on  the  reverse  time  exponential  decay  of  § '(t^,t)  and  the 


difference 


vv(tf’t)pF  - v(x  >t)V(tf,t)'pF 


and  clearly  satisfies 


lim  € = 0 . 

p,  "*  0 


The  0(e)  and  0(p.)  terms  in  (5.47)  shift  the  switching  times  by  some 

variations  dependent  on  p,.  Since  these  variations  go  to  zero  as  p.  - 0, 
£ 

they  are  0(p.  ) dependent  where  a > 0. 

Finally  since  a finite  number  of  switchings  are  varying  by  0(p  ) 


and  T = O(p) 


* — * , a. 

T * T + 0(p.  ) 


and  Lemma  5.3  is  proved. 

The  point  here  is  not  the  particular  value  of  a but  rather  that  for 


p,  £ (0,p.  ] it  is  possible,  by  varying  the  switchings  of  u (t)  and  adding 


some  fast  switchings  to  find  the  optimal  control.  This  might  be  done 
either  by  an  iterative  method  such  as  the  one  presented  here  or  by 
switching  sensitivities  C 22 J . 
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From  Lemmas  5.2  and  5.3  it  is  clear  that  u (t)  is  a near-optimal 
control  for  the  slow  control  problem,  in  the  sense  that  it  steers  x(t)  to 
within  0(p.)  of  in  near  minimum  time  T . 

We  now  begin  the  task  of  finding  a near  optimal  control  u (t)  for 

-ft 

the  time-optimal  control  of  (x',z')  to  (x^y, Zp/)/ • Suppose  that  u (t) 

■■  7^ 

is  applied  to  (5.1);  then  z(t^)  will  be  within  some  bounded  region 

■ ijf 

surrounding  z . Since  z(t,)  is  some  finite  distance  from  z and  z is 
Ft  F 

X 

controlled  in  a fast  (— ) time  scale  it  is  possible  to  steer  z from  z(t-) 

r 

to  Zj,  by  some  fast  switchings  of  0(p.)  duration.  To  this  end  we  define 
the  fast  system 


pz  = gCXy.t)  + G(Xp,t)z  + B2(xF,t)u 


(5.48) 


where  x^  is  the  fixed  final  point  for  x.  System  (5.48)  is  linear  time- 

* 

varying  and  we  consider  the  problem  of  finding  the  control  u^(t)  which 

■ ■*j^* 

time-optimally  steers  z from  z(t.)  to  z . Due  to  the  presence  of  p,  in 

t F 

^ ■ i ii 

(5.48)  the  time  Tf  required  for  uf(t)  is  0(p).  Suppose  that  u (t)  is 

1c 

applied  to  (5.1)  to  drive  x to  0(p.)  from  x^,.  Applying  uf(t)  for  an 
additional  0(p.)  time  will  leave  x 0(p)  from  x^,. 

Lemma  5.4.  The  control 


5*(t)  =< 


u*(t) 


u*(t) 


, * — k _* 

t € ltf,tf+Tf) 


(5.49) 


is  near  optimal  in  the  sense  that  it  steers  (x/,zO/  from  (Xq',Zq/)/ 
to  0(p)  from  (Xp/,Zp')'  in  near  minimum  time  T*  + T*  = T*  + 0(pa). 
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Proof  of  Lemma  5.4.  It  has  already  been  shown  that  u (t)  steers  x 
to  0(n)  from  x^.  The  next  step  is  to  show  that  it  steers  z to  0(h) 
from  Zp. 

On  the  interval  tvv-jp 

x(t)  * Xj,  + 0(H)  • (5.50) 

Let  e(t)  * z(t)  - z(t).  Then 

* . 

e * z - z 

■ g(x,t)  - g(Xp»t)  + Cg(x, t)  - G(Xj,t)]z  (5.51) 

+ [B2(x,t)  - B2(xF,t)]u  . 

By  an  analogous  argument  to  that  in  the  proof  of  Lemma  5.3  and  (5.50)  it 
can  be  shown  that  e(t)  is  0(H).  Since  uf(t)  steers  z to  z^,  u (t)  steers 
z to  0(H)  from  zp. 

★ — "k  dL  * 

Finally  T^  ■ 0(h)  and  therefore  T^  + T^  = Tg  + 0(H  )•  The  time  Tg 
for  the  slow  control  problem  satisfies 

T < T 
s — 

and  thus 

— * * , a. 

T + Tf  = T + 0(H  ) • 

This  completes  the  proof  of  Lemma  5.4. 

5.4  Example 

This  example  illustrates  the  result  of  applying  the  near  optimal 
control  u*(t)  to  the  control  of  the  system 
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x ■ - sin  x + ~2  zi  + ~2  Z2 

= - z^  + u (5.52) 

pz2  * " 2z2  + 2u 

where  n- = 0.1.  Let  the  initial  and  final  states  be  (tt/2,  0.7,  -0.8)  and 
(0,0,0)  respectively.  Setting  (j,  = 0 yields  the  reduced  order  system 

x **  - sin  x + u . (5.53) 


The  reduced  time-optimal  control 

u (t)  =*  - 1 t € [0,1) 


(5.54) 


steers  x from  tt/2  to  0.  When  this  control  is  applied  to  (5.52),  (x,z1>z2) 
is  steered  to  (.0679,  -1,  -1).  This  is  acceptable  if  we  are  not 
interested  in  steering  z.  For  instance  if  z represents  actuator  dynamics 
the  final  position  of  the  actuator  may  not  be  important.  But  suppose  we 
are  interested  in  steering  z to  zero.  Then  we  find  the  control 


• if 

uf  (t) 


t € [1,  1.1098) 
t 6 [1.1098,  1.139) 


which  steers  (z^z^  from  (-1,  -1)  to  (0,  0).  When  u (t)  followed  by 
u*(t)  is  applied  to  (5.52)  the  state  (x.z^.z^)  is  steered  to  (0.658,  0,00, 
0,00).  Thus  x is  steered  to  within  57.  of  its  final  state,  and  z is  steered 
to  0.  The  near-optimal  control  is  calculated  entirely  on  reduced  order 


systems . 
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6.  NONLINEAR  EXAMPLES 


6.1  Introduction. 


In  this  chapter  two  nonlinear  examples  are  presented.  One  of  these 
examples  does  not  fit  into  the  form  of  system  (5.1)  and  therefore  the 
theory  developed  in  Chapter  5 is  not  directly  applicable.  However,  due 
to  the  uniform  asymptotic  stability  of  the  homogeneous  part  of  the  fast 
subsystems,  both  of  these  examples  have  the  two  time-scale  property.  The 
purpose  of  this  chapter  is  to  demonstrate  ways  to  treat  particular 
problems  based  on  the  two  time-scale  property. 

In  each  example  the  controls  are  calculated  for  reduced  order  time- 
optimal  problems.  Then  iterative  schemes  are  applied  to  calculate  optima], 
controls.  The  first  example  is  a magnetic  suspension  system  with  both 
state  and  control  constraints.  As  a result  of  the  form  of  the  time- 
optimal  control  for  the  reduced  system,  a singular  arc  in  the  time-optimal 
control  for  the  full  order  system  is  proposed.  The  second  example  is  more 
complicated  and  in  order  to  find  the  time-optimal  control  for  this  system 
a nested  iterative  scheme  is  developed. 

In  the  discussion  of  each  example  an  effort  is  made  to  show  the 
thinking  used  in  the  development  of  the  results.  The  general  approach  is 
always  to  first  find  the  time-optimal  control  for  the  reduced  order 
system.  This  time-optimal  control  steers  the  slow  state  x of  the  full 
order  system  to  a point  which  is  0( p.)  from  its  final  desired  point.  The 
first  iterative  method  is  designed  to  perturb  this  control  in  such  a way 
that  the  resultant  control  steers  x to  its  final  desired  final  point.  Then 
the  control  which  steers  the  state  of  the  fast  subsystem  to  its  final 
desired  state  in  minimum  time  is  found.  This  control  steers  the  state  z 


T'r 


J 


- 
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of  the  full  order  system  to  a point  0(p.)  from  its  final  desired  point. 
Once  again  an  iterative  method  is  applied  to  change  this  control  so  that 
it  steers  z to  its  final  state.  Then  the  intermediate  point  algorithm 
is  applied  to  find  the  time-optimal  control  which  steers  (x",  z7)7  to  its 
final  desired  point. 


y 


6.2  Magnetic  Suspension  System 

The  system  treated  here  is  nonlinear  in  the  state  z.  Thus  this 

system  is  not  of  the  form  of  (5.1).  From  the  two  time-scale  property  it 

is  conjectured  that  the  time-optimal  trajectory  has  a singular  arc.  Also 

a state  constraint  becomes  a control  constraint  in  the  reduced  problem. 

* 

The  problem  is  to  find  the  control  u (t)  which  time-optimally  steers 
the  state  of  the  system 


x]L  = 10x2 


x2  = - 20(z)Vxl  + 10 


(6.1) 


Lz  * - Rz  + l 


from  an  initial  point  (x^q,x2q,Zq)  to  a final  point  (x^.x^.z^,)  subject 
to  the  constraints  u < Umax  and  ] z } < 1.  Also  it  is  assumed  that 
0.1  < x^  < 1 and  |x2  | < 1.  System  (6.1)  is  one  possible  model  of  a 
magnetic  suspension  system  consisting  of  an  electro-magnet  which  is 
suspending  an  iron  ball  beneath  it.  The  states  x^  and  x2  are  the  position 
and  velocity  of  the  ball,  z is  the  current  through  the  magnet  and  u is  the 
voltage  input.  The  parameters  L and  R represent  the  inductance  and 
resistance  of  the  magnet. 

In  attempting  to  control  such  a system  an  engineering  assumption  that 


y 

. •'  be  made  is  that  the  current  z can  be  changed  instantaneously  from 
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one  value  to  another.  This  corresponds  to  setting  L = 0 and  thus  L takes 

the  place  of  p.  in  this  system.  The  parameter  L is  set  to  zero.  Then 
2 

(z)  *■  v,  constrained  0 < v < 1,  can  be  treated  as  control  for  the  reduced 

order  system 


xx  = 10x2 


= - 20v/x^  + 10  . 


(6.2) 


The  reduced  problem  is  to  find  v (t)  which  steers  (x^,x2)  from  the 

initial  point  (x^q,X20^  t0  t*le  Point  ^xif»x2F')  in  minimum  time* 

★ 

The  slow  control  problem  for  (6.1)  is  to  find  the  control  u (t) 

s 

which  time-optimally  steers  (x^,x2)  from  (x1q,x20^  t0  ^X1F*X2F^  w*t*1 

final  value  of  z free.  This  control  is  found  in  two  steps.  First,  a 

•k 

near  optimal  control  is  found  based  on  v (t).  Then  this  control  is 

iteratively  adjusted  to  ug(t).  Finally  the  time-optimal  u (t)  is  found 

via  the  intermediate  point  algorithm.  For  this  system,  the  value  z is 

r 

the  amount  of  current  which  will  hold  the  ball  in  the  position  x^  with 
the  velocity  x2f  = 0.  Thus  xip>x2F’ZF  is  a nom^na^  Point  around  which 
a regulator  can  be  designed. 

The  Hamiltonian  for  the  reduced  control  problem  is 


H = 1 + 10pjX2  - 20p2v/x1  + 10p2  (6.3) 

where  p^  and  p2  are  the  adjoint  variables  corresponding  to  the  states  x^ 
and  x2  and  satisfy 

Pl  « - 20vp2/(x1)2 
?2  = ‘ • 


(6.4) 
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By  the  minimum  principle 

0 

v*(t)  = < 


if  20p2/Xl  < 0 
if  20p2/Xl  > 0 


(6.5) 


and  H * 0 along  the  optimal  trajectory.  It  can  be  shown  that  the  reduced 
problem  is  normal  and  therefore  no  singular  arc  arises.  Thus  v * 0 or 
v * 1.  A phase  plane  study  shows  that  the  possible  control  sequences  in 
the  region  of  interest  are  {l,0},  {0,l},  {o}  and  {l}*  The  reduced  system 
time-optimal  trajectory  from  (x^jc^)  = (0. 3,0,0)  to  (x1F»x2p)  = (0.6, 0.0) 
is  plotted  in  Figure  6.1  and  the  corresponding  time-optimal  control  is 


* 

v (t)  = 


t € [0,  .0658) 
t € [.0658,  .0920)  . 


The  next  step  is  to  find  the  slow  control  u (t).  Suppose  that  the 

s 

control  sequence  which  drives  (x^,x2)  from  (x^q,x2q)  to  (x^.^p)  is 

tvl,v2}.  For  L sufficiently  small  a near  optimal  control  is  the  following: 

2 

1.  Apply  u^  = + Umax  to  time-optimally  steer  z to  z = v^ 

at  t * t^. 

2 

2.  Apply  u2  = R or  0 to  hold  z on  z = v^  until  (x^,x2)  hits 

the  switching  curve  for  the  reduced  system  at  t = t2> 

2 

3.  Apply  u^  = + Umax  to  time-optimally  steer  z to  z = v2 

at  t ■ t.. 

2 

4.  Apply  u.  = R or  0 to  hold  on  z = v.  until  x_  = x0_  at 

4 

t = t.  . 

4 

At  t a * X£p  and  x^  is  0(L)  from  x^  at  the  position  x^R 

(x^  reached)*  For  smaller  L the  distance  |x^-x^  j is  smaller.  The  near 


> 

» 
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optimal  phase  plane  trajectory  for  the  points  (x^x^)  = (0.3, 0.0)  and 

(x^jX^p)  = (0.6, 0.0)  is  plotted  in  Figure  6.2  for  R = 1 and  L = .05. 

In  Figure  6.3  the  corresponding  z trajectory  is  plotted  versus  time. 

Note  that  for  these  values  of  R and  L,  x^  = 0.68. 

The  conjecture  is  that  by  varying  t^,  i = 1,...,4,  with  the  sequence 
ci  * 

(u^.u^.u^.u^ ) the  control  ug(t)  will  be  found.  If  this  is  true  the  time- 
optimal  trajectory  will  contain  a singular  arc  as  either  u^  or  u^  = 0 on 
a finite  interval.  On  this  interval  z is  also  zero.  When  u and  z^  = 0 
the  equation  for  x2(t)  is 

X2(t)  = lOt  + K (6.6) 


where  K is  a constant.  The  minimum  principle  provides  the  necessary 
conditions 


H 


+ pL10x2 


^up_u;  /x. 


(6.7) 


where  the  adjoint  variables  satisfy 

px  = - 20p2(z)2/(X;L)2 
P2  = ' iOpj^ 
q = 40zp2/x1  + Rq/L 

and 

u < 0 if  q > 0 

u > 0 if  q < 0 . 


(6.8) 


(6.9) 


Thus  for  u to  be  0 on  a finite  interval,  q must  be  0 on  that  interval. 
This  implies 


.1.  ... 
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q - o 

*>2=0  (6.10) 
=>  H * 1 + p110x2  + lOp^  ■ 0 


and 


■>  Px  * 0 


P2  * " 10pl 


q » 0 


“>  Pl  = Kl,  P2  = - 10Kxt  + K2 


(6.11) 


where  and  are  constant.  Equations  (6.10)  and  (6.11)  imply 


-l-10p2  lOOKjt  10K2 

X2  = lOpj^  = 10K1  + lOKj  K^~ 


= lOt  + 


-i-iok2 

10K1 


Thus  if  in  (6.6) 


-i-iok2 

10K1 


(6.12) 


(6.13) 


the  singular  arc  satisfies  the  necessary  conditions  of  the  minimum 
principle. 

The  near  optimal  control  steers  (x^xp  from  (x1Q  x2Q)  to  (xlR,x2F). 

There  must  be  a point  (x^G>x2p^  such  that  a near  optimal  control  based  on 
★ — — 

v (t)  steering  (x^x^  to  this  point  would  steer  (x^x^  to  (x1F>x2p’ 

An  iterative  method  for  x^G  is  as  follows: 

1.  Initialize  with  x.  ^ = x1c,. 

1G  IF 


- 
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2.  Find  the  near  optimal  control  u ^n\t)  to 

S 1G  ZF 

Let  the  position  reached  be  x^R^ . 

3 x (D+1)  = x (n)-  afx  (n)  - x ) 

-5.  x1G  x1G  a(>XlR  1FJ* 

4.  If  |xlR(n)  - xlp,|  < 0.01,  stop. 

5.  Let  n *"  n+1  and  go  to  step  2. 

Experimentally  a * 0.6  provides  convergence  within  fifteen  iterations. 
In  this  way  ug(t)  is  found.  In  Figure  6.4  the  phase  plane  trajectory 
corresponding  to  u*(t)  for  (x10’x20^  = (°*3 4 5 6 7 8>°*°)  and  (xif,X2F^  = (0.6, 0.0) 
is  plotted  and  in  Figure  6.5,  the  corresponding  z versus  time  trajectory. 


Suppose  that  ug(t)  may  be  found,  as  above,  to  steer  (x^.x^)  from 

(x^q.x^q)  to  any  intermediate  point  (x^^.x^^).  Then  we  may  apply  the 

* 

following  intermediate  point  algorithm  to  find  u (t)  which  steers 

(x1,x2,z)  from  (X10>*20*Z0^  t0  (X1F’X2F’  V * 

1.  Initialize  with  x^^  = x]_y>x2i^’>  = X2F’  n = 

2. _  Find  ug^(t)  to  steer  (x^,x2)  to  (x^ ,x2^n^)  In  time 


3.  Find  z(T  (n)). 

s 

4.  Find  u^n^  to  steer  z^  from  z^(Tg^U^)  to  x^p  in  time  ^ . 


5.  Find  x1(Tg(n)  + Tf(n)),  x2(Tg(n)  + Tf(n)), 


6.  If  lXl(Tg(n)  +Tf(a))  - xlF!  + |x2(Tg 


+ !x2(Tg(n)  + Tf(n))  - x2p|  < s 


where  e is  a predetermined  allowable  error,  stop. 

7.  Integrate  (6.1)  in  reverse  time  with  u^^  to  find 

(n+1)  (n+1)  . (n)  _ . . , 

x,,  ,x„_  . That  ls.  u.  steers  (x. .x.)  from 

u ' zi  t ' i'  Z' 


(XlI(n+1).x2i(n+1))  t0  (xIF,X2F}' 


8.  Let  n *"  n+1  and  go  to  step  2. 

After  the  algorithm  converges  a control  has  been  found  which  steers 
(x1,x2,z)  from  (x10»x20’2o)  to  (X1F»X2F^  in  time  Tg(n)  + For  che 


problem  of  steering  from  (0.3, 0.0, .387)  to  (0.6, 0.0, .548)  with  L = .05 

and  R ■ 1,  the  intermediate  point  algorithm  converges  to  drive  (x^jX^) 

to  10  ^ from  (xip»xip)-  T^e  times  required  for  this  are  Tg^n^  = .902, 

* .002.  This  is  proposed  to  be  time-optimal  as  is  27.  of 

Tg^n^  and  Tg^  is  77.  larger  than  T the  time  required  for  v (t).  In 

Figure  6.6  the  x^-X2  plus  phase  plane  trajectory  is  plotted  and  in 

Figure  6.7  the  trajectory  z versus  t is  plotted.  For  systems  such  as 

(6.1)  the  smaller  L the  closer  T ^ is  into  T*.  Thus  there 

s t 

exists  iP  such  that  for  L £ (0,L^)  the  control  provided  by  the  intermediate 
point  algorithm  is  optimal. 


6.3  An  Example  of  Nested  Iteratives 


The  system  treated  in  this  example  is 


X1  = X2 


. 1 , , <X2> 
2 = 2 <Vx2)e 


|1Z^  = - z^  + u 


= " 2z2  + 2x2  + 2u 


where  the  control  us  is  constrained 


lul  < 1 


(6.14) 


(6.15) 


and  p.  Is  a small  parameter  greater  than  zero.  The  problem  under  con- 

★ 

sideration  is  that  of  finding  the  control  u (t)  which  time-optimally 
steers  the  state  (x^,-x.^,z^,z0)  from  a fixed  initial  point  (xiq,X20,  z10»  ziq) 


to  the  origin. 


the  intermediate  point  algorithm 
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Setting  p to  zero  yields  the  reduced  order  system 


L 


X1  ” x2 


1 - <v2 


(6.16) 


x2  * 2 Ue 


The  solution  to  the  problem  of  finding  the  control  u (t)  which  time- 
optimally  steers  (x^,x2)  from  (x2q,x20^  t0  the  ori8in»  in  time  T , is  in 
[37].  There  it  is  shown  that  for  (x^qjZ^q)  in  the  region  G, 


- (x  ) 

(x10,x20)  € G = {(x10'x20):  Ko'  < 1 + e 20  5 


(6.17) 


the  optimal  control  exists  and  for  (x^q,x20^  not  *’n  t*le  re8*on  G an 
e-optimal  control  [36]  exists.  For  the  problems  considered  for  (6.14) 
and  (6.16)  it  is  assumed  that  (x^q,x2q)  ^ G. 

Since  the  final  desired  point  is  the  origin,  x„_  = 0,  the  fast 


subsystem  is 


pz^  = - z^  + u 


pz2  - - 2z2  + 2u  . 


(6.18) 


The  near  optimal  control  for  system  (6.14)  is 


u (t) 


t € [0,T  ) 


_*  . , — *■  — * 
uf(t)  t € [T  ,T  +Tf) 


(6.19) 


where  u^(t)  steers  (z^,z2)  from  (z^(T  ),z2(T  ))  to  the  origin  in  minimum 
time  Tf.  For  the  initial  point  (x1Q,x20, z1Q, z2Q)  = (1,1, -1,-1)  and 
p = .4,  the  near  optimal  control  is 


90 


f -1  t 6 [0,315) 


u(t)  = / 


1 


-1 


t 6 [3.15,4.80) 
t € [4.80,5.24) 


1 t 6 [5.24,5.30) 


(6.20) 


\ 

which  steers  (x^,x2> z^, z^)  of  (6.14)  to  (- .29,- .25,0,0,- . 2) . The  phase 
plane  trajectory  of  versus  x2  is  plotted  in  Figure  6.8.  The 
corresponding  trajectories  z^  versus  t and  versus  t are  plotted  in 
Figures  6.9  and  6.10.  For  y.  = .1  and  the  same  initial  point  the  near 
optimal  control 

-1  t € [0,3.15) 

1 t 6 [3.15,4.79) 


u(t)  = < 


-1  t € [4.79,4.90) 

1 t € [4.90,4.93) 


(6.21) 


V 


steers  (x^x^z^z^  to  (-.06, -.07, 0.0, -.056)  and  thus  as  expected  the 
near  optimal  control  steers  the  state  to  a point  closer  to  the  origin. 

In  order  to  improve  on  the  near  optimal  control  we  make  the  con- 
jecture that  the  optimal  control  may  be  found  by  perturbing  the  switching 

★ 

times  of  u(t).  This  is  done  in  two  steps.  Let  ug(t)  be  the  control 
which  time-optimally  steers  (x^>x2^  of  (6.14)  from  (xiq*x20^  to 

«Hp  "if 

origin  in  time  Tg  and  let  u^(t)  be  the  control  which  time-optimally 

steers  (z^z^  from  (z1(Tg) , z2(Tg) ) to  the  origin  in  time  T . 

The  controls  u (t)  and  u,(t)  are  found  by  an  iterative  method  which 
s i 

is  essentially  the  same  as  the  one  described  in  the  magnetic  suspension 
example.  Here  this  method  will  be  described  in  terms  of  finding  the 

"jlf 

control  u (t).  The  fast  control  uf(t)  is  found  analogously. 

S £ 


Figure  6.9.  The  z.  versus  t trajectory  corresponding  to  control 
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There  must  be  a point  (x^G,x2G)  sucb  that  the  control  u (t)  designed 
to  steer  (x.,x2)  from  (x10,x20)  to  (X1G»X2G)  wil1  steer  (Xx> x 2)  to  any 
desired  point  (xlF»x2F)*  Thus  u (c)  for  the  final  point  (X1G»X2G>  ia  io 
fact  u*(t).  Let  (x1r»x2r)  be  the  Point  to  which  u*(t)  steers  (xx,x2)  of 
(6.14).  The  following  is  an  iterative  method  for  (xiG»x2G)  and  us(t). 

1.  Initialize  with  (X1G^°^ »x2g^°^  * ^X1F,X2F^‘ 

2.  Find  the  control  u^t  which  steers  (xx>x2)  to 

(x.._^n\x0/,^n^)  in  minimum  time  T^n\  Let  the  point 
1G  ZG 

reached  by  (x^,x2)  when  u^n\t)  is  applied  to  (6.14) 

ho  (n>  v <a>\ 

be  (x1R  ,x2R  ). 

3 x (n+1)  - x (n)  - a(x  (n)  - x ) 

3.  x1G  x1G  a(-xlR  X1F'  ’ 


(n+1) 


a(x2R  " X2F)# 


4.  If  |x1Rv  - xlp|  + |x2Rv  - x2Fl  < 0.01,  stop. 

5.  Let  n n+1  and  go  to  step  2. 

After  this  method  has  converged  u^(t)  should  be  us(t).  As  in  the 
magnetic  suspension  the  value  a ■ 0.6  provides  convergence  in  less  than 
15  steps  for  the  initial  points  which  were  treated  experimentally. 

In  this  manner  an  improved  near  optimal  control  is  found 


u (t) 


ic 

U <t> 


u*(t)  . 


(6.22) 


For  the  initial  condition  (-1,2, -1,-1)  considered  above  and  p.  * 0.4, 


_* 

u (t) 


-1  t € [0,3.06) 

1 t € [3.06,5.00) 

-1  t € [5.00,5.45) 

1 t € [5.45,5.57) 


(6.23) 
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and  steers  (x^^, z^, Z2)  to  (.032,  .076,0.0,  .009).  This  is  an  improved 
control  over  control  (6.20)  since  it  steers  the  state  of  (6.14)  closer 
to  the  origin. 

The  intermediate  point  algorithm  can  now  be  applied  since  it  is 

•ft  ^ 

possible  to  find  ug(t)  to  steer  (x^,X2)  to  points  (x^^p)  and  uj(t)  to 
steer  to  the  origin.  Thus,  even  though  (6.14)  is  not  block 

diagonalized,  controls  may  be  found  to  steer  the  slow  and  fast  states  to 
desired  points  from  controls  calculated  for  reduced  order  systems. 

After  six  iterations  of  the  intermediate  point  algorithm  the  control 


[ -1  t € [0,3.105) 


U 


(6) 


(t) 


t € [3.105,5.046) 
t 6 [5.046,5.253) 


| 1 t € [5.253,5.304) 


(6.24) 


steers  (x^jX^.z^, z^)  from  the  above  initial  point  to  (.0047, .037,-. 3X10~^, 
.0008)  for  p,  = 2.  The  intermediate  point  algorithm  is  applied  in  nested 
iterations.  That  is,  in  each  iteration  of  the  algorithm  these  are 
iterations  for  ug v '(t)  and  u^  '(t).  For  u (t),  in  Figure  6.11,  the  x^ 
versus  X2  trajectory  is  plotted  and  in  Figures  6.12  and  6.13,  the  z^ 
versus  t and  z^  versus  t trajectories  are  plotted. 

For  this  example  the  intermediate  point  algorithm  does  not  provide 
any  particular  improvement  over  the  near  optimal  control  u (t).  The 
control  is  calculated  as  the  concatenation  of  a time-optimal  control  for 
the  reduced  order  system  and  a time-optimal  control  for  the  fast  subsystem. 
The  calculation  of  the  time-optimal  control  for  the  reduced  order  system 
requires  Runga  Kutta  integration  for  the  switching  times  and  therefore, 
since  the  intermediate  point  algorithm  requires  six  iterations,  it 


Figure  6.12.  The  z.  versus  t trajectory  corresponding  to  control 
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requires  six  times  as  much  integration.  With  this  integration  the  slow 

control  cannot  be  calculated  to  steer  the  x states  to  much  closer  than 
-2 

10  of  the  final  point.  For  the  same  initial  state  and  p = 0.2,  the 
xrx2  trajectory  corresponding  to 


1 t € [0,3.099) 


s*(t)  = < 


1 

-1 


t € [3.099,4.898) 
t € [4.898,5.123) 


j 1 t € [5.123,5.183) 


(6.25) 


is  plotted  in  Figure  6.14.  The  final  error  is  approximately  the  same  as 
that  in  Figure  6.11.  The  trajectory  in  Figure  6.11  is  apt  to  be  closer 
to  the  shape  of  the  optimal  trajectory.  However  u (t)  requires 

less  time  and  is  therefore  a better  control  if  our  criterion  is  that  of 
steering  the  state  close  to  the  final  point.  This  is  a case  where  the 
optimal  control  is  not  necessarily  the  best  control  to  use.  This  is 
particularly  true  since  the  models  for  real  systems  are  not  perfectly 
accurate  anyway. 

Thus  by  using  the  two  time-scale  property  and  the  near  optimal 
controls,  iterative  methods  can  be  developed  in  order  to  steer  the  state 
of  a full  order  system  using  controls  calculated  from  reduced  order 


systems . 
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7.  CONCLUSION 


In  this  thesis  we  first  developed  stability  bounds  for  p in  the 
linear  time-varying  system 


x * A^(t)x  + A12(t)z  + B1(t)u 
* A21(t)x  + A22(t)z  + B2(t)u 


(7.1) 


That  is,  for  p within  these  bounds  (under  approximate,  assumptions  on 

(7.1) )  the  uniform  asymptotic  stability  of  the  full  order  system  and  fast 
subsystem  are  guaranteed.  Then  the  problem  of  steering  the  state  of 

(7.1)  from  a fixed  initial  point  to  a fixed  final  point  in  minimum  time 
was  treated.  The  two  time-scale  property  for  this  problem  was  revealed 
by  expanding  the  necessary  conditions  of  the  minimum  principle  in  the 
singular  perturbation  parameter  p.  In  a similar  manner  the  two  time- 
scale  property  of  the  time-optimal  control  of 


x * f(x,t)  + F(x,t)z  + B^(x,t)u 
pz  = g(x,t)  + G(x, t)z  + B2(x,t)u 


(7.2) 


was  revealed.  This  property  implies  that  the  time-optimal  control  has 
switchings  in  a slow  time-scale  on  an  initial  interval  and  then  switchings 
in  a fast  time-scale  on  the  final  interval. 

On  the  basis  of  the  two  time-scale  property  a near  time-optimal 
control  was  developed  for  (7.1)  and  (7.2).  These  near  optimal  controls 
were  applied  to  the  two  nonlinear  examples  and  some  iterative  methods  were 
developed  to  improve  the  near-optimal  control. 

Thus  the  main  results  were  the  development  of  bounds  for  p in  (7.1), 
the  revelation  of  the  two  time-scale  property  for  (7.1)  and  (7.2),  the 
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development  of  near  optimal  controls  and  the  calculation  of  controls  for 
some  nonlinear  examples. 

There  are  several  possible  directions  for  future  research.  The 
classes  of  singularly  perturbed  systems  which  possess  the  two  time-scale 
property  might  be  extended.  The  solution  of  control  problems  for 
singularly  perturbed  systems  with  state  constraints  requires  further 
study.  Such  a study  would  have  as  its  purpose  the  discovery  of  properties 
of  constrained  control  problems  on  the  basis  of  the  nature  of  the  solution 
for  near  optimal  controls  calculated  for  reduced  order  systems.  An 
example  of  this  is  in  the  magnetic  suspension  system  presented  in  the 
last  section  in  which  a singular  arc  is  proposed  based  on  the  time-optimal 
control  of  the  reduced  order  system.  For  linear  time- invariant  singularly 
perturbed  systems  it  has  been  shown  [29]  that  the  time-optimal  feedback 
control  for  the  full  order  system.  This  result  could  be  extended  to 
linear  time-varying  and  some  classes  of  nonlinear  systems.  Finally, 
research  could  be  made  in  order  to  develop  iterative  methods  designed  to 
take  advantage  of  the  time-scale  decomposition  which  is  characteristic  of 
the  solution  to  other  optimal  control  problems  for  singularly  perturbed 
systems . 
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